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HUMAN AMINOPEPTIDASE P GENE 
Cross Reference to Related Application 

This application claims benefit of U.S. Provisional Application No. 
60/057,854, filed September 2, 1997. 
5 Background of the Invention 

Evidence of an aminoacylproline hydrolase was first encountered in 
studies of the metabolism of bradykinin (BK). It was found that BK (Arg- 
Pro-Pro-Gly-Phe-Ser-Pro-Phe-Arg) is inactivated virtually quantitatively 
during a single passage through the rat pulmonary vascular bed (1,2). BK is 

1 0 degraded through 5-8 half-lifes during the 2-3 sec required for a single transit 
from the right to the left side of the heart (31). If, as appears to be the case, 
the overall metabolism occurs within the pulmonary capillary bed (mean 
transit time of about 0.2 sec), the half-life of BK within the capillary bed is 
on the order of 0.03 sec. From these data, it was postulated that the relevant 

1 5 kininase enzymes are situated on, or near, the luminal surface of pulmonary 
endothelium so as to have access to intravascular substrates (1,2,4). In time, 
it was shown that angiotensin converting enzyme (ACE) plays a central role 
in the inactivation of BK and is, in fact, disposed on the luminal surface of 
pulmonary microvascular endothelium (5,6). ACE was found to account for 

7 8 

20 one of the hydrolytic reactions (cleavage of the Pro -Phe bond) observed in 
the earliest studies (2). 

The original data indicated that a peculiar aminopeptidase also 
participated in the degradation of BK (1,2). The result caused some concern 
and confusion in that none of the aminopeptidases then known was capable 

25 of hydrolyzing an imido bond (Arg'-Pro 2 ). Shortly thereafter, an 

aminoacylproline hydrolase was isolated from an extract of E. coli and was 
shown to be capable of hydrolyzing polyproline and the Arg'-Pro 2 bond of 
BK (204,205). The bacterial enzyme was named aminopeptidase P, a name 
now used for aminoacylproline hydrolases obtained from any animal or plant 

30 source. 

Shortly after the discovery of E. coli AmP, it was found that pig 
kidney extracts contained a particulate-associated AmP activity and that the 
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AmP-like substance was not solubilized by detergents (78). The AmP-like 
material, believed to have been solubilized in a butanol/aqueous solvent 
system, behaved like a complex mixture of substances on chromatography. 
As a further complication, pig kidney AmP did not hydrolyze polyproline, 
5 the substrate used to assay E. coli AmP. A weakly reactive synthetic 
substrate was prepared, Gly-Pro-Hyp, and AmP activity was measured in 
terms of the rate of formation of free glycine in a two-step assay protocol 
(78). 

Membrane-associated AmP remained effectively inaccessible to 
10 conventional chemical and biochemical analysis until the early 1990's. The 
difficulties of AmP catalytic assay was solved by preparing the synthetic 
substrate Arg-Pro-Pro-[ 3 H]benzylamide (APPBz- 3 H) (20 Ci/mmol), a 
substrate related to the N-terminal tripeptide of bradykinin (21,22). 
APPBz- 3 H proved to be highly reactive with AmP and could, by virtue of its 
1 5 high specific radioactivity, be used under conditions of first order enzyme 
kinetics. However, the problem of the inefficient solubilization of 
particulate-associated AmP was not solved, and a search for soluble forms of 
AmP was therefore conducted. Guinea pig serum was found to be an 
enormously rich source of AmP (21,22) and was used as the starting material 
20 to obtain apparently homogeneous AmP in two isoforms, Mr 89,000 and 
81,500 (22). 

Independently, Hooper et al (1 1 1) solved the problem of solubilizing 
pig kidney AmP. They found that AmP is bound to membranes via a 
glycosyl phosphatidylinositol (GPI) lipid anchor and can be solubilized 

25 efficiently using phosphatidylinositol-specific phospholipase C (PI-PLC). 
Subsequently, Simmons et al (180) and Ryan, et al. (32) showed that rat and 
bovine lung and guinea pig lung and kidney forms of AmP are solubilized by 
PI-PLC. Human kidney AmP is also solubilized by PI-PLC. Once thus 
solubilized, AmP no longer behaves anomalously on conventional 

30 chromatography matrices. 

Aminopeptidase P (AmP; EC 3.4.1 1.9) is the only known human 
enzyme capable of hydrolyzing a N-terminal imido bond, a bond common to 

2 
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many collagen degradation products and some neuropeptides, cytokines and 
vasoactive peptides (14,16,21,22,31,98,1 11,146,147,152,165,192,205). 
AmP occurs in cell membrane-bound and intracellular soluble forms and is 
not uniformly distributed among tissues nor among cell-types of a given 
5 tissue (21,39,165,205), which implies that physiologic roles of AmP are 
determined by anatomic disposition (a determinant of reaction conditions 
and access to substrates) as well as by catalytic selectivity. 

It is therefore an object of the present invention to help define both 
molecular and anatomic determinants of AmP functions. 
10 Summary of the Invention 

The genomic DNA and full-length cDNA sequence of human kidney 
AmP has been determined. The deduced amino acid sequence indicates that 
AmP is a member of the recently-recognized "pita bread-fold" protein 
family, a family of very little sequence homology but of high similarity in 
1 5 three-dimensional structure (59). Within the "pita bread-fold" family, there 
is a subdivision called the "proline peptidase" family, with which human 
kidney AmP shares at least five short blocks of amino acid sequences of fair 
to high homology (although overall homologies are low). These blocks are 
known to contain the amino acid residues that compose the catalytic site of 
20 E. coli methionine aminopeptidase, a metallo-peptidase whose structure has 
been determined by x-ray crystallography (59). Based on these comparisons, 
it is postulated that human kidney AmP amino acid residue H430 serves as 
the proton shuttle, and D450, D461, H520, E555 and E569 (see SEQ ID 
NO:2) are the catalytic metal ligands. This can be tested by preparing the 
25 site-specific mutants H430F, D450N, D461N, H520F, E555Q and E569Q. 
In addition, each of five potential N-glycosylation sites and each of five C 
residues can be mutated to examine for indirect effects of glycosyl groups 
and disulfide bonds on catalytic activity, solubility and protein stability. In 
addition, one can determine the chromosome location of AmP. 
30 Using the sequence and immunocytochemistry at the level of electron 

microscopy (EM), one can define, in major organs, the cellular and 
subcellular sites of AmP, and, using subcellular fractions, dispositions of 
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AmP in terms of anatomically proximate receptors and cell signaling 
molecules (the bradykinin B2 receptor, eNOS and guanylate cyclase) whose 
activities may directly or indirectly be affected by AmP activities. 

This will help characterize structure-function relationships of human 
5 AmP at three levels: 1. molecular structure/catalytic activity, 2. cellular and 
subcellular distributions that determine orientations (and access to substrates) 
of the catalytic site, and 3. disposition in respect to "nearest-neighbor" 
effector and cell signaling molecules. 

In addition to providing conceptual advances in understanding of 
10 AmP functions, this work provides tools (antibodies and oligonucleotide 
probes) useful for clinical studies of AmP deficiency states. 

Use of the cDNA, genomic DNA, or a combination, for protein 
expression has commercial implications. The inferred amino acid sequence 
can be used as a starting point for defining higher structure and function. 
1 5 Through protein expression, crystals can be prepared for determination of 
higher structure. Reverse transcriptase-polymerase chain reactions was used 
to obtain four overlapping fragments of AmP cDNA. The intact full-length 
cDNA can be obtained by ligation. The first (nt 1-474) and second (359- 
734) fragments are digested with Xmnl (nt 365) and then ligated. The 
20 product (1-734) and the third fragment (634-1702) are digested with SacI (nt 
652) and ligated to yield 1-1702; which, with the fourth fragment (1588- 
3428), are digested with Seal (nt 1625) and ligated to yield 1-3428. DNA 
encoding human AmP can also be produced by direct synthesis of 
appropriate oligonucleotides based on the disclosed amino acid and 
25 nucleotide sequences. For large scale protein expression, the full-length 
DNA is transferred into the expression vector pVL1393 and used with co- 
transfectant, Baculogold, in the baculovirus/Sf9 insect cell system. This 
system has the capacity to produce recombinant AmP in the amounts needed 
for x-ray crystallography. Knowledge of cellular and subcellular sites of 
30 AmP will be predictive of the consequences of specific peptidase deficiency 
or inhibition. Membrane-bound forms appear to be disposed as 
ectoenzymes, which can be verified by EM immunocytochemistry. Soluble 

4 
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AmP is believed to be disposed in as yet unknown intracellular sites. Actual 
dispositions can be determined as a means of defining functional roles of 
AmP: AmP disposed in the endoplasmic reticulum of, for example, 
lymphocytes is expected to have functions and reaction conditions different 
5 from ectoenzyme forms disposed on renal proximal tubule and small 

intestine brush border epithelia and different yet again from AmP disposed 
on the luminal surface of vascular endothelium. 

Oligonucleotide probes and primers can be used to identify patients 
with homozygous or heterozygous AmP deficiencies. Primers can be used to 
1 0 examine for faulty AmP mRNA. Two pediatric patients with apparent 
homozygous deficiencies have been identified, at least one of which was 
mentally-retarded, epileptic and microcephalic. Early gene therapy could 
moderate any central nervous system injuries attributable to the lack of AmP, 
if administered early enough. Prenatal diagnosis of an AmP deficiency state 
1 5 would help decision making by parents and health care providers. 

As a member of the so-called "pita bread-fold" protein family, human 
AmP has a recognizable putative proton shuttle and five putative metal 
ligands. With molecular modelling, and expressed protein, one can design 
inhibitors of AmP. Since AmP inactivates the blood pressure-lowering 
20 oligopeptide bradykinin, inhibitors of AmP could be useful as 

antihypertensive agents. Bradykinin is reported to be antimitogenic and 
antiatherogenic. Thus, inhibition of AmP (and concomitant preservation of 
bradykinin) should be useful in preventing or limiting arterial stenosis or 
restenosis and development of atherosclerosis. By similar means, the 
25 structure of AmP can be used to design synthetic substrate, which in turn can 
be used to develop diagnostic assays based on AmP catalytic activity. These 
substrates and others will be of value, along with recombinant AmP, for 
screening of drugs designed to inhibit AmP. 

Since AmP is a protease capable of hydrolyzing N-terminal imido 
30 bonds it should be useful in degrading industrial protein feedstocks to free 
amino acids, and in breaking down wastes that have significant protein 
content, especially proline-rich collagenous protein wastes (wastes that are 
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otherwise resistant to degradation by better-known enzymes such as trypsin 
and chy mo trypsin). In so-called intestinal malabsorption syndromes, 
patients are sometimes given encapsulated digestive enzymes to improve 
breakdown of foodstuffs. AmP should be a beneficial additive to the mix of 
5 encapsulated enzymes to facilitate breakdown of proiine-rich peptides. 

Human AmP cDNA and genomic DNA can be used for designing 
antisense oligonucleotides, which may, in turn, be useful in patients having a 
surplus of AmP that, for example, contributes to arterial stenosis or 
restenosis or that contributes to development of atherosclerosis. By analogy 

10 with uses of AmP inhibitors, some downward modulation of AmP activity 
via use of antisense nucleotides might provide antihypertenstive effects. 

There are now some highly reliable computer programs that can 
identify peptide sequences within the primary structure of a protein that are 
likely to be immunogenic. Such programs can be used to identify 

15 immunogenic sequences within the inferred human AmP structure. Thus, 
knowledge of the nucleotide sequence of human AmP cDNA and genomic 
DNA can lead to the design of synthetic "epitopes" and preparation of highly 
specific polyclonal and monoclonal antibodies. Antibodies are useful in the 
development of immunoassays having diagnostic uses. Alternatively, 

20 recombinant expression of AmP protein clearly provides an appropriate 
antigen for preparing specific antibodies to AmP. 

Human AmP cDN A and genomic DNA can be used to develop 
transgenic animal models and can be used, under low stringency conditions, 
to clone AmP cDNAs and genomic DNAs of other animal species. By the 

25 latter means, knockout animal models can be prepared and provided 

commercially to other investigators. The AmP cDNA and genomic DNA 
can also be used to prepare stable transformants that can be provided 
commercially to other investigators. With knowledge of the AmP DNA 
sequence and its coding for putative critical amino acid residues of the 

30 catalytic site, mutants can be prepared to modulate catalytic activity. 

Similarly, unglycosylated, truncated forms of AmP can be expressed that are 
catalytically active but more amenable than wild-type AmP to crystallization. 

6 
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Such forms should be highly useful to drug design firms. 

The DNA of a functionally related enzyme, angiotensin converting 
enzyme (ACE), is known to be polymorphic, and one form is associated with 
high levels of serum ACE. Human AmP cDNA and genomic DNA can be 
5 used to examine for polymorphisms, which, if found, can be further studied 

for functional impacts. 

Brief Description of the Drawing 

Figure 1 is a comparison of the amino acid sequence of human 
aminopeptidase P to porcine amino peptidase P. 
10 Detailed Description of the Invention 

History of Isolation and Physiological Roles of AmP 

Using guinea pig serum AmP as immunogen, mouse polyclonal and 
then monoclonal antibodies, which were found to bind guinea pig and rat 
lung and kidney forms of AmP at high affinity, were prepared. One of the 

1 5 monoclonal antibodies, HL5 1 0, also binds human AmP (32). The anti-AmP 
preparations have proved to have many uses and have been particularly 
helpful in immunoaffinity chromatography. Immunoaffmity 
chromatography has substantially simplified the task of purifying AmP and 
yields apparently pure AmP in the mole quantities needed for structure 

20 studies. HL5 10 has also been used for light microscopy 

immunocytochemistry. In guinea pigs, spleen, kidney, liver, lungs and small 
intestine are particularly rich sources of catalytically-active immunoreactive 
AmP. 

Extensive amino acid sequencing using guinea pig lung and kidney 
25 forms of AmP were performed. Protein and cDNA databases were searched. 
It was found that guinea pig AmP contains at least three of six blocks of 
highly conserved sequences characteristic of a recently recognized group of 
proteins called the proline peptidase family. The match of primary structures 
appears to have functional significance in that all family members (e.g. 
30 human proline dipeptidase) are, like mammalian AmP, capable of 

hydrolyzing imido (as opposed to amido) bonds. The conserved blocks 
provided a simple guide for cloning AmP cDNA because one could then 
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specify, in terms of block placements (e.g. block C within the middle of 
AmP and block F near the C-terminus), pairs of primers that would yield 
large or small stretches of cDNA. 

Recently, pig kidney cortex AmP has been sequenced almost 
5 completely (Edman degradation and some mass spectroscopy) (196). In 
addition, AmP and the entire proline peptidase family have been postulated 
to be members of a larger protein family ("pita bread-fold" family) not 
characterized by common functions but by highly similar 3 -dimensional 
structures. Using these new findings and our data, human kidney AmP 
10 cDNA was clone. Unexpectedly, the primers prepared originally to 

correspond with guinea pig lung AmP sequences worked as well with human 
kidney mRNA. 

In addition, the database searches made evident that AmP has a 
clinical relevance greater than previously supposed. A Medline search of the 

1 5 biochemistry of human proline dipeptidase (PDP) raised several apparently 
relevant issues. For example, PDP deficiencies are well-documented and 
appear to be caused by several different gene defects, including single base 
mutations and inappropriate splicing (83,84,187,188). Both protein-positive 
and protein-negative PDP deficiency states have been described. It is 

20 therefore believed that the genetics of PDP deficiencies will provide a guide 
for searches for AmP deficiencies. It was also found that AmP deficiencies 
have been reported and can have clinical expressions like those seen in PDP 
deficiencies. Blau et al (62) found two boys of consanguineous parents who 
excreted in urine a mixture of proline-containing oligopeptides, including 

25 Gly-Pro, a dipeptide excretory product characteristic of PDP deficiency. 
However, the boys excreted in greater amounts (up to 30 mg/day) a 
tetrapeptide, Gly-Pro-Hyp-Gly, not seen in urines of normal subjects nor 
patients with PDP deficiencies. It was determined that the excreted 
tetrapeptide has a sequence identical to the N-terminal tetrapeptide of a 

30 putative hormone called antiarrhythmic peptide (AAP) (Gly-Pro-Hyp-Gly- 
Ala-Gly) (51-56). In the characterization of guinea pig serum AmP (21,22), 
it has been found that AmP binds AAP at high affinity in vitro. Gly-Pro- 

8 
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Hyp-Gly is among the commonest tetrapeptide sequences of collagen, and its 
excretion in urine of an AmP-deficient patient may reflect failure in late 
stage collagen metabolism and amino acid conservation. 

In addition, Blau et al (62) found that an intestinal biopsy sample 
5 obtained from one patient contained saccharase and PDP activities within 
the normal range but did not contain AmP catalytic activity (less than 2% of 
the normal mean). Both of the patients were mentally-retarded. One, in 
addition, had microencephaly and epilepsy. Mental retardation is also a 
characteristic of PDP deficiency (84,187,188). It may be relevant to both 
10 deficiency states that AmP and PDP occur in rat cerebral cortex, largely in 
association with astrocytes (98,147,148). It may also be relevant that 
astrocytes appear to 'guide 1 vasculogenesis in retina (30) and perhaps in other 
parts of the central nervous system. AmP-related mental retardation may 
result in part from deficient vasculogenesis in early development of the 
15 central nervous system. 

In rat lungs, AmP and ACE together account for all of the 
bradykinin-inactivating activity. This result initially appeared to be 
inconsistent with the fact that five of the eight peptide bonds of BK are 
hydrolyzed during passage through the rat pulmonary vascular bed (1,2,14). 
20 However, biologically-inert metabolic fragments of BK are not, a priori, 
invulnerable to proteolytic attack. Indeed, one pulmonary endothelial 
peptidase, dipeptidyl peptidase IV (DP IV), cannot hydrolyze BK (124,199) 
but can rapidly degrade the BK fragments formed by AmP (e.g. des- 
Arg'-BK) to release Pro-Pro. Pro-Pro is a BK metabolite. It thus appears 
25 that three of the five hydrolytic reactions occur after BK has been inactivated 
by AmP and/or ACE. Recently, Simmons and colleagues (120) have 
confirmed these findings and have shown that combined inhibition of ACE 
and AmP has profound blood pressure lowering effects in renin-related 

hypertension (119). 
30 There are large interspecific differences in distributions of AmP. 

AmP occurs in abundance in guinea pig and rat kidneys and lungs but is 
virtually absent from rabbit and cat kidneys and lungs (16). In fact, human 
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tissues were found to have AmP in relatively high abundance (Table 1). By 
Northern blot analysis, human kidney, liver, small intestine, heart, lung, 
colon and placenta are particularly enriched in AmP mRNA (47). 

Table 1 Relative abundance of aminopeptidase P mRNA in various 

human tissues 

Tissue Relative intensity 

Kidney ioo 
Lung 32 
Heart 42 
Placenta 16 
Liver 55 
Small Intestine 55 
Colon 21 
This information can be used to a) relate molecular structure to AmP 
catalytic activity, b) define its cellular and subcellular dispositions so as to 
clarify orientations of the catalytic site, and c) define the anatomic 
relationships of AmP to functionally-related "nearest-neighbor" effector and 
cell signaling molecules. 

The underlying hypothesis is that roles of AmP in systemic 
biochemistry are likely to be determined by reaction conditions, access to 
substrates and responses of "nearest neighbors" (all set by anatomical 
relationships) as well as by catalytic selectivity. Thus, AmP disposed near 
cell matrix may be well-positioned to participate in secondary, tertiary or 
higher stages of collagen metabolism, AmP disposed on intestinal brush 
border epithelium likely functions as a specialized digestive enzyme, AmP 
disposed on renal proximal tubule epithelium plausibly participates in 
conservation of proline, AmP in neuronal tissues may process neuropeptides, 
and AmP disposed on vascular endothelium processes circulating peptide 
hormones such as bradykinin (31,34,39,181). It has been suggested that 
soluble forms of AmP disposed in platelets and lymphocytes may act to 
modulate effects of cytokines and peptides that mediate acute inflammation 
(165,191,192,205). 
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As noted above, two pediatric patients with AmP deficiencies have 
been identified, both mentally-retarded (62). Whether mental retardation can 
be attributed to AmP deficiency is not yet clear, but the possibility should be 
testable in that the cloned human AmP cDNA provides a guide for preparing 
5 AmP knockout mice. Similarly, there is a basis for blocking AmP 
expression in rats by use of antisense AmP oligonucleotides. Rat AmP 
cDNA can be cloned using human AmP cDNA or fragments, as a probe. 

Both AmP-deficient patients excreted oligopeptides having N- 
terminal Xaa-Pro- residues, and these peptides (most notably Gly-Pro-Hyp- 
10 Gly) may most directly reflect the AmP deficiency state. AmP should 
degrade the latter peptide to form Gly plus Pro-Hyp-Gly. The latter is a 
substrate for DAP IV, and the expected dipeptide, Pro-Hyp, is a substrate for 
proline dipeptidase (PDP). Normally, human renal proximal tubule contains 
AmP, DAP IV and PDP in abundance (1 10), and the three enzymes may 
1 5 constitute a cascade of reactions important for amino acid conservation. 

Homozygous AmP deficiencies are probably rare. Partial AmP 
deficiencies may be relatively common, a possibility that has been suggested 
vis a vis a side effect of angiotensin converting enzyme (ACE) inhibitor 
therapy (31): ACE inhibitors are widely-used for the treatment of 
20 hypertension and congestive heart failure (93). Most patients experience 
few, if any, side effects. However, a small percentage of patients develop 
urticaria and angioedema (99), problems that can also occur when bradykinin 
is infused i.v. in relatively high doses (66). It appears that AmP is normally 
the last defense against the entry of BK into the systemic arterial blood of 
25 patients treated with ACE inhibitors (3 1). Clearly, patients with a relative or 
complete AmP deficiency could be at exceptional risk if treated with an ACE 
inhibitor. When angioedema affects tissues of the upper airway, thereby 
obstructing air flow, death can occur within minutes. Therefore, even though 
angioedema is an uncommon side effect of ACE inhibitors, it would be 
30 worthwhile to determine its molecular basis. If an AmP deficiency underlies 
ACE inhibitor-induced angioedema, a pretreatment test for the deficiency 
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could spare some patients from life-threatening ACE inhibitor-induced 
angioedema. 

Using the AmP catalytic assay described herein and the knowledge 
that human plasma, platelets, lymphocytes and urine (all being readily 
accessible biopsy tissues) normally contain AmP catalytic activity 
(21,106,165,191,205), untreated hypertensive patients can be screened now 
for AmP deficiencies. Antibodies to human AmP and genetic probes can be 
produced. Thus, AmP deficiency states, protein-positive and protein- 
negative, and their bases at the molecular level can all be determined. 

Molecular structure and function 

Purification of aminopeptidase P . 

Three groups independently purified aminopeptidase P (AmP) to 
apparent homogeneity. As noted above, guinea pig serum is a rich source of 
soluble AmP, which can be purified to obtain two isoforms, Mr 89,000 and 
81,500 (22). On concanavalin-Sepharose chromatography, both isoforms 
were found to behave as a mixture of biantennary and high mannose 
glycoproteins (70%/30%). Turner and colleagues (1 1 1) purified pig kidney 
cortex AmP, after converting the amphipathic into the hydrophilic form with 
phosphatidyls sitol-specific phospholipase C (PI-PLC), to obtain an 
apparent single isoform, Mr 95,000, that was converted by treatment with N- 
glycosidase F into two isoforms, Mr 71,500 and 68,000. Simmons and 
Orawski (180) purified bovine lung AmP, solubilized with PI-PLC, which on 
SDS-PAGE migrated at Mr 95,000. All three purifications were laborious 
and required seven or more steps. The Turner protocol employed nine steps 
and provided apparently pure AmP in a 1% yield (111). Guinea pig, pig and 
bovine forms of AmP all behaved as if N-blocked on Edman degradation. 

To obtain a simpler means of purifying AmP, two mice were 
immunized with the biantennary form of guinea pig serum AmP Both mice 
produced high titer anti-AmP, which, on Western blotting, proved to be 
reactive with both AmP isoforms, Mr 89,000 and 81,500. The spleen of one 
mouse was used to produce hybridomas, twelve of which produced anti- 
AmP, all of the IgGi isotype. A hybridoma that produced anti-AmP with 
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anticatalytic effects on reaction with guinea pig serum, rat kidney and human 
serum AmPs, was selected. After double cloning, ascites monoclonal 
antibodies, known hereinafter as HL510, was produced. 

HL510 was used to prepare an immunoafFinity matrix (antibody 
5 bound to protein A-Sepharose and then crosslinked with a Afunctional active 
ester). The immunoaffinity matrix enabled isolated of homogeneous guinea 
pig AmP in the quantities needed for amino acid sequencing and was used to 
purify hydrophilic (post-PI-PLC treatment) forms of kidney and lung AmP 
as well as serum AmP. A 4 ml column of the matrix was used repeatedly to 
1 0 obtain a total of about 20 nmol of apparently pure AmP 

Others tried to purify soluble "cytosolic" forms of AmP. Harbeck 
and Mentlein (98) obtained highly-purified rat brain AmP, which behaved on 
molecular sieving (Mr 143,000) as if a dimer of the Mr 71,000 monomer 
found on SDS-PAGE under reducing conditions. Whether brain AmP is an 
1 5 unglycosylated alternative gene product related to gpi-anchored AmP is not 
yet clear. The profile of rat brain AmP in terms of selectivity of substrate 
hydrolysis and responses to inhibitors and other affectors is similar to those 
of kidney, lung and serum AmPs, and alternative splicing of the primary 
transcript may account for the apparent absence of gpi-anchoring. However, 
20 it has been reported that some strains of E. coli contain two AmP products 
and two separate genes (206), which may also be the case for human AmPs. 

Soluble forms of AmP have also been purified from human platelets 
(191) and leukocytes (165). Both AmPs migrate on SDS-PAGE at Mr 
71,000. On molecular sieving, platelet AmP behaves as a trimer (Mr 
25 223,000) and leukocyte AmP behaves as a dimer (Mr 140,000). No direct 
studies have been performed to clarify glycosylation, but human platelet 
AmP was not retained by a mixed concanavalin A/wheat germ lectin 
chromatography matrix. 
Sequencing . 

30 Guinea pig AmP behaved on Edman degradation as if N-blocked. 

LysC digests of both kidney and lung AmP were therefore prepared, and the 
peptide products separated on Tris-tricine gels (79,172). Partial digestion 
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conditions were used to generate relatively large fragments. Separated 
peptides were blotted to a Problott membrane (ABI), and three lung and four 
kidney AmP fragments were selected for sequencing. Where overlaps 
occurred (80 amino acid residues), lung and kidney AmPs were found to be 
5 identical in structure. 

The "BLAST" network (49,50) was used to look for possible 
similarities to known proteins. The search picked up a tentative match with 
human proline dipeptidase (PDP). A second search using the "BLOCKS" 
program (102) revealed that guinea pig AmP contains at least three of six 

10 highly conserved blocks of amino acid sequences that define a newly- 
recognized protein family called the proline peptidase family. Further details 
on how guinea pig kidney and lung AmPs line up with sequences of known 
members of the proline peptidase family (of which PDP is a member) were 
obtained using the program "I ALIGN" (77). Through the foregoing analysis, 

1 5 it was evident that guinea pig kidney and lung AmPs contained all of proline 
peptidase blocks C,E and F. 

The six conserved blocks in human prolidase (blocks A-F) are 
arranged alphabetically from the N-terminus. By comparison with human 
PDP, the order of sequenced fragments of guinea pig kidney and lung AmP 

20 was deduced. Importantly, the expected length of protein between the 

fragments could be estimated, keeping in mind that the number of residues 
between conserved regions in AmP are not the same as found for other 
members of the family (blocks E and F are fused in AmP but are separated 
by more than 20 amino acid residues in PDP) (83,84). This information 

25 reduced the number of PCR primers that one would need to test and provided 
clues for analyzing PCR data. Knowledge of the placement of blocks of 
conserved sequences also provided clear directions for the use of nested 
primers. 

The proline peptidase group is a small family of related proteins 
30 including E. coli aminopeptidase P II, E. coli proline dipeptidase and human 
proline dipeptidase (PDP; prolidase). All three of these proteins are 
classified as manganese metalloenzymes, primarily because they are 
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stimulated by Mn 2+ . In this regard, mammalian AmP is also stimulated by 
Mn 2+ in its reaction with some, but not all, substrates (22, 31, 32, 1 1 1, 152, 
180). Zinc, 0.2 mole, was reported to be present per E. coli AmP subumt as 
detected by atomic absorption spectrophotometry (206), and pig kidney AmP 
5 is reported to contain about 1 mole of Zn per mole of enzyme (108). Finding 
a match of guinea pig lung AmP with human PDP was intriguing because of 
their similarity in substrate selectivity. Proline dipeptidase cleaves imide 
bonds of dipeptides in which proline is C-terminal, whereas AmP acts as an 
aminoacylproline hydrolase (22,29,32,83,188). 
10 Matthews and colleagues solved the three-dimensional structure of E. 

coli methionine aminopeptidase (AMPM) by x-ray crystallography (59). 
They began a database search for sequence-relatives and found 12 other 
proteins with small blocks of fair sequence similarity. One sequence-relative 
was found to be P. putida creatinase (CREA), another protein whose three- 
1 5 dimensional structure is known. Although the primary sequence homology 
between AMPM and CREA is low, Matthews and coworkers found that each 
protein possessed a C-terminal domain disposed in a "pita bread" fold. 218 
C a atoms of each protein are superimposable to within 2.5 A. Further 
examination of the primary sequences of other sequence-relatives of AMPM 
20 (including AmPs of E. coli, S. lividans and M. tuberculosis) revealed, in each 
case, aaPPP sequences characteristic of "pita bread" folds. Of no less 
importance, binding sites for the catalytic divalent metal of AMPM were 
well-characterized and were known to be disposed on either side of a two-0- 
sheet cleft common to AMPM and CREA. Based on the homologous 
25 tertiary structural blocks of the AMPM "pita bread" family and their 

similarities to at least four of the conserved blocks of the proline peptidase 
family in combination with the sequence data, one could predict part of the 
tertiary structure of guinea pig AmP and identify at least four of the metal- 
binding amino acid residues of the catalytic site; all without knowing the 
30 complete amino acid sequence. Block C clearly is within a p-sheet of the 
catalytic crevice and contains two divalent metal ligands (later identified in 
human kidney AmP as D450 and D451; see below), and blocks E and F are 
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clearly part of an apposing 0-sheet and contain two more metal ligands 
(E555 and E569 in human AmP). 

Dr. Wolfram Schafer of the Max-Planck-Institut fiir Biochemie 
sequenced most of pig kidney AmP by Edman degradation and some mass 
5 spectrometry (196). Within the limits of the sequence data, guinea pig and 
pig AmP sequences are 93% identical and 98% highly homologous. 

The data on human AmP, with those of Matthews and colleagues (59) 
and Schafer and colleagues, makes evident that mammalian AmPs contain 
the six conserved blocks characteristic of the proline peptidase family and 

1 0 that all known members of the proline peptidase family in fact compose a 
subgroup of the AMPM/CREA family of proteins characterized (not by their 
functions but) by their "pita bread" tertiary conformations. Blocks A and B 
of proline peptidases are parts of exterior ct-helices and blocks C, D, E and F 
are parts of the two apposing p-sheets that contain the catalytic site. With 

1 5 the primary sequence of human kidney AmP (see SEQ ID NO:2), the 
catalytic metal binding sites could be assigned: block C, D450 and D461; 
block D, H520; block E, E555; and block F, E569. A putative proton shuttle, 
H430, could also be postulated. Each of the putative divalent metal-binding 
ligands and the putative proton shuttle is a reasonable target for preparing 

20 site-specific mutants. 

Human kidney AmP cDNA 

There are large interspecific differences in AmP abundance and 
distributions among organs (16,21). Using human kidney and lung poly A 
RNAs in reverse transcriptase polymerase chain reaction (RT-PCR) studies 
25 with degenerate guinea pig primers, five cDNA fragments whose nucleotide 
sequences enabled preparation of nondegenerate primers for human AmP 
cDNA were obtained. 

A sense primer based on QMDCNW (now known to be residues 124- 
129 of human AmP) was used with a reverse primer based on FQKEAY 
30 (residues 474-479) to obtain a 1068 bp fragment. Fragments from three 
separate PCR reactions were subcloned (TA Cloning Kit, Invitrogen) and 
sequenced. All three independent PCR products were found to have 
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identical sequences, ruling out PCR nucleotide-incorporation errors. The 
remaining 5' and 3' nucleotide sequences were obtained by RACE methods. 
5'-RACE was performed using both human kidney and lung poly A RNAs. 
PCR products were subcloned and sequenced. Kidney and lung cDNA 
5 sequences were identical for the N-terminal open reading frame plus 264 
bases of the 5 '-untranslated region. 3'-RACE was performed to obtain the C- 
terminal portion of AmP coding sequence plus a 1 145 base 3 '-untranslated 
region. Two independent reactions gave identical sequence results. 
Composite cDNA and amino acid sequences. 
10 The composite cDNA sequence is shown in SEQ ID NO: 1 The 

DNA sequence has an open reading frame of 2019 nucleotides. The deduced 
amino acid sequence (SEQ ID NO:2) comprises 673 residues with a 
calculated molecular weight of 75,490. Comparison of the human AmP 
amino acid sequence to that of the pig (reported by Turner, 1 13) shows 
1 5 evolutionary divergence with only 83% amino acid sequence identity 

between the two species (Figure 1). Five of six potential N-glycosylation 
sites found in the pig sequence at residues 34, 48, 64, 277, 290, and 294 are 
conserved in the human sequence at residues 35, 49, 65, 278, and 291. F.ve 
of six cysteine residues that are potentially involved in disulfide bond 
20 formation are also conserved. These are located in the human sequence at 
positions 36, 127, 294, 299, and 53 1 . By comparison of the human AmP 
amino acid sequence with that of £ coli methionine aminopeptidase (59), it 
is postulated that, for human AmP, H430 is the proton shuttle and D450, 
D461, H520, E555 and E569 are the catalytic metal ligands. Site-specific 
25 mutants can be used to test this and to determine placements of disulfide 
bonds. Potential N-glycosylation sites can be mutated to examine for effects 
on AmP solubility and stability. 

Because AmP is a GPI-anchored protein, it is expected that the 
mature protein can be derived from a nascent form containing N- and C- 
30 terminal signal peptides that are removed during processing in the 

endoplasmic reticulum. Based on the weight-matrix method of von Heijne, 
analysis of the pig sequence (113) suggests that the N-terminal cleavage site 
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is either Lys-24 or His-22. The most important sequence positions in the von 
Heijne method are those at -1 and -3. If Lys-24 represents the true cleavage 
site this would put Pro at the -1 position in the pig sequence which is unusual 
in eukaryotic signal sequences. Lys-24 and His-22 are both conserved in the 
5 human sequence as are the -1 and -3 positions relative to His-22 (Figure 1). 
The -3 position relative to Lys-24 is also conserved. The -1 position, 
however, contains a Thr residue rather than a Pro which is more commonly 
found in this position in eukaryotic signal sequences. Based on the cleavage 
prediction criteria developed by Udenfriend and Kodukula (61), Ala-649 has 

1 0 been predicted to be the C-terminal co-residue in the pig enzyme with Arg 
and Ala in the important ©+1 and co+2 positions, respectively (113). 
Identical ©, eo+1, and (0+2 residues are found in the human AmP enzyme 
(Figure 1). The exact anchorage site can be examined by mutation and by 
Edman degradation and mass-spectrometry of C-terminal peptides produced 

15 by GluC digestion. 

Genomic DNA Sequence of Human AmP. 
A search of GenBank using the human AmP cDNA sequence 
revealed a sequence, dJ753P9 (an unfinished human chromosome X genomic 
sequence from the Sanger Center group of the Human Genome project), 

20 containing human AmP sequences. A comparison of this clone with the 

AmP cDNA sequence revealed segments of the genomic sequence that were 
in the wrong orientation or relative position, or which were spurious. These 
errors would not have been readily apparent without comparison to the 
cDNA sequence. Using the cDNA sequence as a guide, the jumbled 

25 dJ753P9 sequence was rearranged to arrive at the genomic sequence of 
human AmP, including introns. A second sequence, dJ454M7 (a genomic 
sequence containing the oculocerebrorenal syndrome gene also from the 
Sanger Center group of the Human Genome project), overlapped the 
dJ753P9 sequence in the upstream region. 1 10,000 nucleotides of the 

30 (U454M7 sequence was combined with the rearranged dJ753P9 sequence to 
arrive at the disclosed human AmP genomic sequence. The sequence data of 
sequences dJ753P9 and dJ454M7 were produced by the X Chromosome 
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Sequencing Group at the Sanger Centre and can be obtained from 
ftp://ftp.sanger.ac.uk/pub/dJ753P9 and ftp://ftp.sanger.ac.uk/pub/dJ454M7, 

respectively. 

The assembled genomic sequence is shown in SEQ ID NOs:3, 4, 5, 6, 
5 and 7. SEQ ID NO:3 shows the first 50,000 nucleotides of the AmP 
genomic DNA (nucleotides 1 to 50,000). SEQ ID NO:4 shows the next 
50,000 nucleotides of the AmP genomic DNA (nucleotides 50,001 to 
100,000). SEQ ID NO:5 shows the next 44,453 nucleotides of the AmP 
genomic DNA (nucleotides 100,001 to 144,453). SEQ ID NO:6 shows the 
10 next 45,546 nucleotides of the AmP genomic DNA (nucleotides 144,454 to 
189,999). SEQ ID NO:7 shows the last 16,955 nucleotides of the AmP 
genomic DNA (nucleotides 190,000 to 206,954). SEQ ID NOs:3, 4, and 5 
represent sequences upstream of the AmP coding region. SEQ ID NO:6 
represents the AmP coding region (including introns) and some downstream 
15 sequences. SEQ ID NO: 7 represents sequences downstream of the AmP 
coding region. The location of introns in the AmP genomic DNA is shown 
in Table 2. The position refers to the nucleotide positions in SEQ ID NO:6. 
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Table 2: Location of introns in the AmP genomic DNA 

Intron Position 

(in SEQ ID NO:6) 

1 49-2893 

2 2969-4749 

3 4861-5990 

4 6054-7023 

5 7129-7382 

6 7470-8394 

7 8542-11255 

8 11361-12535 

9 12614-12936 

10 13135-13947 

11 14038-15260 

12 15372-16083 

13 16159-17270 

14 17346-19969 

15 20030-21300 

16 21370-21959 

17 22068-22796 

18 22854-23481 

19 23560-28390 

20 28415-28418 

21 28482-29079 



The coding region in the exonic sequences contain a total of 2019 
nucleotides, in perfect agreement with the coding region of human AmP 
cDNA. The cDNA sequence (SEQ ID NO:l) contains 264 nucleotides of 5' 
untranslated region, which starts at nucleotide 144,190 in the genomic 

5 sequence (nucleotide 44, 1 90 of SEQ ID NO:5). The 3' untranslated region 
starts at nucleotide 173,725 in the genomic sequence (nucleotide 29,272 of 
SEQ ID NO: 6). Regulatory sequences are present in the sequences upstream 
and downstream of the AmP coding sequence. The locations in AmP 
genomic DNA of restriction sites for rare-cutting restriction enzymes are 

10 shown in Table 3. The position refers to the nucleotide positions of the 
entire genomic sequence (1 to 206,954). 
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Table 3: 
Enzyme 

I-Ceul 

1-DmoI 

I-Ppol 

I-Scel 

PI-PspI 

Pl-Scel 

PI-Tlil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Sfil 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 

Xcml 



Locations in AmP genomic DNA of restriction sites 

Position Recognition 
sequence 



28417 GGCCCTCCTGGCC 

35327 GGCCTGGAAGGCC 

59892 GGCCGCCGCGGCC 

123855 GGCCTGAGAGGCC 

127512 GGCCAAGGTGGCC 

147456 GGCCCTTGTGGCC 

163911 GGCCTCAATGGCC 

173654 GGCCGCCAGGGCC 

174720 GGCCAAATTGGCC 

191056 GGCCCCATCGGCC 

199214 GGCCACAGAGGCC 

805 CCAAGCCCTCCATGG 

3268 CCAGACCCCTGCTGG 

9208 CCACTGAAGGCTTGG 

1 1273 CCAGATGTGTGGTGG 

13446 CCAGTCTAACTATGG 

20139 CCATGCCCCTCCTGG 

22210 CCAGGTGAGAGGTGG 
24186 CCAGATCTCTCCTGG 
30663 CCAAAGCAATCCTGG 
33277 CCAGCCCGGCCATGG 
34994 CCAGGCAATGGCTGG 
38816 CCAGTGGTCTTCTGG 
4 133 1 CCATGTCTCAATTGG 
43990 CCATTGTGGCTATGG 
44005 CCATGCCTAGTCTGG 
5 1 655 CCAAGGAATGGCTGG 
54873 CCAGGAGGGGGGTGG 
55199 CCAAGACAAGCCTGG 
56459 CCAGCCGGGCCCTGG 
57685 CCAAGGACAAAGTGG 
59638 CCAGCCGCCCCATGG 
62439 CCAATCCTTGATTGG 



21 



BNSDOCID <WO 9911799A2J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11799 



PCT/US98/18426 



Table 3 continued* 


Xcml 


VJ«y J J J 


Xcml 


646 IS 


Xcml 

./VWJ.JJLL 


68860 


XctnT 


601 7S 

U7 X / -J 


Xcml 


71841 

/ X OtJ 


XcmT 


749 SO 


Xcml 


89876 


Xcml 


84001 

OH77J 


XcmT 


8S461 


XcmT 


01011 


XcmT 


018S1 


XcmT 


1 01 910 


XcmT 


101 S77 

1U1J / / 


XcmT 


1 09 1 61 


XcmT 


1 04088 


XcmT 

/Vvllii 


1 OS 177 

1 UJ X / / 


XcmT 


1061 SI 


Xcml 


106489 


XcmT 


106S41 


Xcml 


106619 


Xcml 

■'Willi 


107121 

lv/ 1L1 


Xcml 


1 10156 


Xcml 

/VvllU 


1 10952 


XcmT 


1 1 211 2 

1 X £*D X 


XcmT 


190998 


XcmT 


1 9 1 774 

1 Z X / /*T 




194997 


XcmT 


1 90919 


XcmT 


1 10760 


XcmT 


moos 


YrmT 


1 1901 1 


XcmT 


119081 

X J^70 1 


XcmT 


111419 


XcmT 


1 11086 

1JJ70D 


Y rm T 


1 1S91 7 
1 jjZ X / 


XcmT 


1 56950 

X JUZJU 


XcmT 


158191 

X JO 1^1 


VrmT 




Xcml 


159043 


Xcml 


159777 


Xcml 


165124 


Xcml 


166087 


Xcml 


167088 



CCATAACAGCTATGG 

CCACGTCTCTTGTGG 

CCAGTTCCGTTATGG 

CCACAAACTTCGTGG 

CCACTGGTTTGGTGG 

CCACTTTTTGATTGG 

CCAGTATCTCAGTGG 

CCATGCCTGATCTGG 

CCAGGGGAGAAATGG 

CCAGGGTTGGTGTGG 

CCAATCACAGGGTGG 

CCATCATTTTCTTGG 

CCACCAACTGGGTGG 

CCAAGAAGCACCTGG 

CCACAAGGCTCTTGG 

CCATAGACTGGGTGG 

CCAGCCCCACTATGG 

CCAGGGGCTTGTTGG 

CC AGTGGAGGC CTGG 

CCAGTGCAAGAGTGG 

CCAAGGATGAGATGG 

CCAGCTCAGCCTTGG 

CCAACTGACCAGTGG 

CCATCTGTCTGCTGG 

CCAAGCACAGGATGG 

CCATTGGCCACTTGG 

CCATCCTCTCCCTGG 

CCAATTCTTTCTTGG 

CCATATGTCCCCTGG 

CCAAGCCACATCTGG 

CCAGCCAGCAATTGG 

CCAGCACCGACTTGG 

CCAGAGAGGGGCTGG 

CCACCCCATCTATGG 

CCAATGAGAACATGG 

CCAGGGACCCACTGG 

CCAGAGTGCTGGTGG 

CCAAATTATTCCTGG 

CCAATTCCTAACTGG 

CCAAAGGCACAGTGG 

CCACATCGCCTCTGG 

CCACAGCAATTATGG 

CCAGAGCCAATCTGG 
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Table 3 continued 


YrniT 
ACUU 


169063 


VrmT 


173427 


YrmT 


174118 




178624 




178990 


VrmT 


182319 


YrmT 
A.CIIU 


1 82870 


Acini 


183061 




184682 


A.CITL1 


185891 


YrmT 


185967 


Yr»ml 


185992 


Acnu 


186440 


Xcml 


188286 


Xcml 


193275 


Xcml 


195033 


Xcml 


199546 


Xcml 


204870 



CCATAAACAACATGG 
CCATCTGGACTATGG 
CCAAGGGTGCCATGG 
CCAGGCCGGGCATGG 
CCAAGGCCTTCCTGG 
CCAGCAAGGACCTGG 
CCAAAGGCCCGATGG 
CCAAAGAATGTATGG 
CCATAGTGACAATGG 
CCACTTTGGCCATGG 
CCAACCTGGAGATGG 
CCATTCCAGTCTTGG 
CCAGGTGCCCTATGG 
CCACTTTCTCCATGG 
CCAGCTCCCCCGTGG 
CCACTGAGGCAGTGG 
CCAAACTGACCATGG 
CCAACTTGACTGTGG 

Tissue Distribution Determined by Northern blots. 

The expression of membrane-bound AmP mRN A in human tissues 
was examined by Northern hybridization analysis of poly (A) + RNA 
(Clontech) using the 1068 bp human AmP cDNA fragment. A single 3.5 kb 
5 message was detected in human kidney, lung, heart, placenta, liver, small 
intestine, and colon. No transcript was detected in Northern analysis of poly 
(A) + RNA from human brain, skeletal muscle, pancreas, spleen, thymus, 
prostate, testis, ovary, and peripheral blood leukocytes. Possibly AmP RNA 
is in low abundance in the latter tissues, which can be determined by RT- 
1 0 PCR studies. The relationships between membrane-bound and soluble forms 
of AmP are unknown, but it may be relevant that heart poly A RNA gave a 
strong signal on Northern blotting. According to Simmons and collaborators 
(152), heart contains AmP in a soluble form. 
Anatomic determinants of function. 
1 5 Because AmP is not uniformly distributed among tissues and is 

apparently disposed as an ectoenzyme on some cell-types and as an 
intracellular enzyme in other cell-types, its roles in systemic biochemistry 
must be determined in part by its cellular and subcellular dispositions, 
distributions that restrict access to substrates and set reaction conditions. 
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Reactions of AmP in vivo. 

To gain further insight into functions of AmP in systemic 
biochemistry, studies were conducted to determine the physiologically- 
relevant question: Is BK in central venous blood hydrolyzed by pulmonary 

5 endothelial AmP in vivo? The immediate metabolic fate of the AmP 

synthetic substrate Arg-Pro-Pro-[ 3 H]benzylamide (APPBz- 3 H) (20 Ci/mmol) 
during a single transit from the right heart to the left was examined. Effects 
of increasing quantities of carrier APPBz and alternative AmP substrates 
such as bradykinin (BK) and des-Arg 9 -BK (3 1) were then measured. It was 

1 0 found that tracer doses of APPBz- 3 H are extensively hydrolyzed (mean 
hydrolysis of about 55% during a 2-3 sec mean transit time) and that the 
metabolic process is saturable (carrier APPBz injected at 42 nmol/kg b.w. 
reduced fractional hydrolysis of coinjected APPBz- 3 H by half). Using 
isolated rat lungs perfused with Krebs-Henseleit solution containing 

1 5 albumin, 4 g %, it was found that APPBz- 3 H is still extensively hydrolyzed, 
a result to be expected if AmP is largely disposed on the pulmonary vascular 
surface (5,6, 1 2, 14,3 1 , 1 67). It was also found that carrier APPBz at 
2 |imol/kg completely inhibited hydrolysis of coinjected tracer substrate and 
can thus be used as a short-acting AmP inhibitor. As implied by the 

20 saturable characteristics of APPBz- 3 H hydrolysis, alternative substrates for 
AmP should, in saturating doses, also inhibit APPBz- 3 H hydrolysis. In fact, 
BK proved to be an alternative substrate of even higher affinity than carrier 
APPBz; coinjected BK, at 13 nmol/kg, reduced APPBz- 3 H hydrolysis by 
half. Des-Arg 9 -BK was an alternative substrate of lesser affinity; ED50 of 

25 107 nmol/kg. Des-Arg^BK is apparently not a substrate for AmP but binds 
to the catalytic site nonetheless (21,22,3 1); thus, des-Arg^BK coinjected 
with APPBz- 3 H was expected to reduce hydrolysis of the tracer, an 
expectation met experimentally (ED50 of 30 nmol/kg). 
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Potentiation of effe rt* of hradvkinin (BK). 
If BK inhibits hydrolysis of APPBz- 3 H, BK hydrolysis by AmP 
should be inhibited by APPBz; a possibility tested as follows: Log dose- 
response curves were constructed by measuring the mean systemic arterial 
5 blood pressure effects of BK injected into the superior vena cava (i.v.) or the 
root of the aorta (La.). As shown previously (1,2,161), BK is extensively 
degraded during passage through the rat pulmonary vascular bed. Thus, the 
i.v. dose of BK required to reduce arterial blood pressure by, say, 25 mm Hg 
is 40 or more times the i.a. dose of BK required to exert equivalent effects. 
10 To the extent that pulmonary AmP contributes to BK inactivation, saturation 
of AmP with an inhibitor or alternative substrate should, in effect, potentiate 
blood pressure effects of i.v. BK. It was found that either of carrier APPBz 
or des-Arg 9 -BK potentiated blood pressure effects of i.v. BK by up to 4-fold. 
Effects of i.a. BK were also potentiated, a result that suggests that AmP is 
1 5 disposed on both pulmonary and extra-pulmonary vascular surfaces. From 
the relationship PF=2", where PF is "potentiating factor" and n is the number 
of biological half-lifes, it can be computed that a 4-fold potentiation of i.v. 
BK effects on blood pressure effects bespeaks the ability of pulmonary AmP 
to degrade BK through two half-lifes in a time interval of less than three sec 
20 (mean pulmonary transit time). Thus, AmP alone can degrade BK by 75%. 
Pulmonary angiotensin converting enzyme (ACE) is a major 
contributor to BK inactivation (1 19,120,167). Inhibition of ACE potentiates 
blood pressure effects of i.v. BK by 40- to 200-fold. The four-fold 
potentiation of i.v. BK effects achieved by inhibition of AmP is less 
25 spectacular but, as discussed below, important nonetheless. To clarify 
relative contributions of ACE and AmP to the metabolic fate of BK 
administered i.v., blood pressure effects of i.v. BK under control conditions 
and then after administration of a long-acting ACE inhibitor, RAC-X-65, 
were compared. After ACE inhibition, BK was injected (i.v. and i.a.) alone 
30 or BK co-mixed with APPBz at a dose capable of saturating AmP 

(2 umol/kg). ACE inhibition shifted both i.v. and i.a. log dose-response 
curves leftward. The i.v. curves were most affected but still lay to the right 
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of the i.a. curves by a factor of about four. Inhibition of both ACE and AmP 
caused the i.v. and i.a. BK log dose-response curves to become 
superimposable. The latter result appears to mean that, in rat lungs, ACE 
and AmP account entirely for the pulmonary metabolism of BK. When both 
AmP and ACE are inhibited, effects of i.v. BK are potentiated by up to 
800-fold. In one experiment of this series, 2.5 ng of i.v. BK (about 2 pmol) 
reduced systemic arterial blood pressure by 20 mm Hg; a finding that gives 
new emphasis to the importance of pulmonary AmP and ACE in preventing 
the entry of BK into the systemic circulation under physiologic conditions. 
Clinical implications. 

Precisely how BK is inactivated in humans is a matter of clinical 
importance. BK is an edematogenic compound capable of inducing urticaria 
when administered i.v., and has been postulated to play a role in induction of 
angioedema (99). Human lungs contain ACE, which is distributed so as to 
have access to circulating substrates such as BK and angiotensin I (66). 
However, there are now several million patients under treatment with ACE 
inhibitors and who therefore lack the ability to inactivate BK via the ACE 
pathway. Available data suggest that BK does not normally accumulate to 
any great extent in the blood of patients treated with ACE inhibitors 
(157,177), which in turn suggests that there is a backup, or supplemental, 
system for BK inactivation. Whether AmP accounts (or accounts for a 
significant fraction) for BK inactivation in patients treated with ACE 
inhibitors is not yet known, but the possibility is worthy of consideration in 
terms of adverse effects of ACE inhibitors. An infrequent, but potentially 
fatal, adverse effect of ACE inhibition is angioedema, a complication that 
may be due to the lack in some patients of a non-ACE BK-inactivating 
system, possibly AmP. It is likely that some subjects lack AmP Blau et al 
(62) have reported that an intestinal biopsy sample from a 15 -year-old male 
contained normal saccharase activity but no measurable AmP activity (less 
than 2% of the mean of control values). Plausibly, subjects lacking AmP 
activity are at risk for ACE inhibitor-induced angioedema. By analogy, a 
relative lack of AmP could be associated with other, more frequent, ACE 
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inhibitor-related side effects such as cough and pemphigus-like skin 
eruptions (3 1,99). 

Immunocytochemistry. 

Functions of AmP are likely to be determined in part by its anatomic 
5 dispositions which are evident by immunocytochemistry performed at the 
level of light microscopy. A description of these studies follows. 
Antihodv specificity . 

HL510 and the polyclonal anti-AmP have been used in the 
immunocytochemistry studies described below. Although the epitope bound 
10 by HL5 1 0 is not yet known, all data support its specificity. HL5 1 0 binds to 
guinea pig plasma, lung and kidney AmP isoforms and works well for 
immunoprecipitations, western blots and immunocytochemical studies. 
HL5 10 binds rat forms of AmP; thus, parallel studies of the disposition of 
AmP in guinea pig and rat tissues were conducted. HL510 has anticatalytic 
1 5 effects on human plasma AmP and has proved to be useful for purifying 

human lung AmP. HL510 reacts specifically with human pulmonary artery, 
lung microvascular and aortic endothelial cells as evidenced by indirect 
immunofluorence and immunoprecipitation studies. HL510 immunoaffinity 
matrix binds the two isoforms (Mr 89,000 and 76,000) of PI-PLC solubilized 
20 guinea pig kidney AmP quantitatively. Prolidase, a proline peptidase family 
relative of MW 56,000 (83), is not bound by the immunoaffinity matrix nor 
by HL510 alone as evidenced by the fact that the two kidney AmP isoforms 
were obtained in homogeneous forms on immunoaffinity chromatography 
(32) and the fact that a Mr 56,000 protein has not been found in AmP 
25 preparations collected by immunoprecipitation. Cross-reacting contaminants 
having Mr' s like those of the AmP isoforms can also be ruled out. Eight 
peptides produced by LysC digestions of immunoaffinity-purified guinea pig 
lung and kidney AmPs have been sequenced. Each of the eight peptides 
sequenced as if pure (no secondary sequence signals). All of the sequences 
30 aligned with high similarity with the pig kidney AmP sequence. These 
results, then, are consistent with the conclusion that HL510 is specific for 
AmP. 
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Light microscopy . 

Based on the early studies of pulmonary angiotensin converting 
enzyme (ACE) (5,6), glutaraldehyde fixatives were used, and the light 
microscopy studies using frozen sections and sections of tissues were fixed 
5 in picric acid/paraformaldehyde. The latter fixative is adequate for electron 
microscope (EM) immunocytochemistry at moderately high resolution (5,6). 
Thus, the data should be directly applicable to EM immunocytochemical 
studies. In addition, Vector ABC kit reagents were used throughout (second 
antibody bridged via a biotin:avidin complex to horseradish peroxidase), and 

1 0 these too can be used for some EM studies. 

As shown in micrographs, the entire alveolar-capillary unit of guinea 
pig lung appears to react with anti-AmP. This is a picture essentially 
identical to that obtained in light microscope immunocytochemistry studies 
of pulmonary ACE (69), a target known to be disposed almost exclusively on 

1 5 the luminal surface of pulmonary endothelial cells (5,6). Resolution at the 
EM level will therefore be required to determine precise cellular and 
subcellular dispositions of AmP. It is very likely relevant, however, that the 
AmP immunoreactivity of the arteriole shown in the left upper quadrant of a 
crograph is restricted to the endothelial layer. Initially, it was believed that 

20 AmP would co-localize with ACE, given the facts that 1 ) ACE is disposed 
on endothelium and 2) both enzymes have access to peptide substrates 
injected i.v. (3 1,167). Results obtained thus far are consistent with 
localization of AmP on endothelium. AmP immunoreactivity in association 
with some airway epithelial cells and mononuclear leukocytes, cell-types that 

25 are not reactive with anti-ACE, has been found. To distinguish similarities 
and differences in the cellular dispositions of AmP and ACE, EM studies are 
planned in which tissue is examined using both mouse anti-AmP and rabbit 
anti-ACE. AmP, unlike ACE, is believed to participate in the metabolism of 
collagen fragments formed by collagenase (205), thus whether AmP is 

30 disposed in, or near, intercellular matrix can be determined. 

Micrographs show that AmP immunoreactivity is also disposed on 
guinea pig renal proximal tubule, jejunum enterocytes and villus vascular 
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cores and in association with microvessels of the endocrine and exocrine 
pancreas. The kidney micrograph illustrates that cells of the glomerulus, 
including glomerular endothelium, lack AmP immunoreactivity. Previous 
studies of ACE were similar: glomerular endothelium was unique among 
5 endothelia studied in that it failed to react with anti-ACE (69). Endothelium 
of small arteries of the renal cortex react with anti- AmP, but by far the 
greatest AmP immunoreactivity is that of the proximal tubules. 

Immunocytochemistry studies of rat tissues were, with one exception 
(see below), consistent with the studies of guinea pig tissues. Pulmonary 
1 o alveolar-capillary units were heavily stained, as were airway epithelial cells. 
Similarly, small intestine enterocytes and kidney proximal tubules were 
heavily stained, and glomerular cells were apparently free of AmP 
immunoreactivity. However, rat spleen contained few, if any, sites of AmP 
immunoreactivity, whereas the red pulp of guinea pig spleen (and 
1 5 microvessels of red and white pulp) were heavily stained. In guinea pig 
spleen red pulp, cells on sinusoid walls were heavily stained, as were 
mononuclear leukocytes. Some human lymphocytes contain a 71,000 Mr 
AmP in soluble form (165), as do rat cerebral astrocytes (147). The results 
indicate that some guinea pig leukocytes possess AmP immunoreactivity. If 
20 the guinea pig leukocyte AmP immunoreactivity represents the soluble 

71 ,000 Mr isoform, one can purify it using the immunoaffmity columns now 
on hand. Gumea pig and rat tissue homogenates were assessed for AmP 
catalytic activity (21,22). Rat kidney, lung and jejunum contained AmP at 
the highest specific activities. For guinea pig, the highest specific activities 
25 were found in homogenates of spleen, kidney, liver, and jejunum. The 
immunocytochemistry results are in accord with the biochemical surveys. 

Immunofluorescence photographs show the reaction of HL510 with 
human pulmonary microvascular endothelial cells in culture. Ref. 34 
contains an immunofluorescence micrograph of the reaction of HL510 with 
30 human aorta endothelial cells. Antibodies to AmP can be prepared by using 
as immunogens unique peptide sequences of human AmP predicted to be 
antigenic (EGCG program). Further to favor precise localizations, 
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antibodies labeled with colloidal gold particles can be used. As a gpi- 
anchored enzyme, membrane-bound AmP is expected to be localized in 
caveolae (137,184). Indeed, the distribution of immunofluorescent spots in 
micrographs is typical of antigens situated in caveolae (90,91). 
5 AmP and its nearest-neighbors. 

Increasingly, it appears that cell membrane receptor/cell signaling 
reaction cascades depend in part on close anatomic proximity of all or many 
of the relevant reactants. Extensive biochemical data have revealed close 
associations between receptors, signaling molecules and caveolins 

10 (74,89,92,130,138,160,173,175,183). Complementary morphologic data are 
lacking. Specifically, subcellular dispositions of endothelial AmP in respect 
to effectors and signaling molecules whose functions may directly or 
indirectly be influenced by AmP catalytic activity can be determined. AmP 
in respect to the bradykinin (BK) B2 receptor, eNOS and guanylate cyclase 

1 5 can be determined. 

As a gpi-anchored enzyme, it is anticipated that AmP is disposed 
within endothelial caveolae (3,4,1 1,74,90,137,173). Some have reported that 
localization of gpi-anchored proteins within caveolae is an artifact 
attributable to crosslinking of antigen: antibody complexes by second 

20 antibodies ( 1 43 , 1 45). However, cytochemical techniques without 

crosslinking agents have shown that endothelial 5 '-nucleotidase, now known 
to be gpi-anchored, is disposed almost exclusively within caveolae (3,4). 
Further, coupled functions of cell membrane receptors with cell signaling 
proteins known to be disposed on the cytoplasmic aspect of caveolae argue 

25 in favor of anatomic proximity (9-1 1,63,89, 127,128,164). 

Given the high likelihood that AmP functions in part as a BK 
inactivating enzyme (3 1) and that BK in near-physiologic concentrations 
(-10 pM) exerts effects on endothelial cells (e.g. mobilization of 
arachidonate and synthesis of TxA 2 and PGI 2 ) (7-1 1), endothelial cells have 

30 been used in culture to examine morphologically for functionally-significant 
anatomic proximities of AmP with the BK B2 receptor and well- 
characterized BK-activated signaling molecules known to be associated with 
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caveoiae (90,91,131,173). Atwo antigen immunocytochemistry approach 
can be used. This should reveal aspects of AmP functions that are dependent 
not only on subcellular disposition but also on functionally-related proximate 
proteins. 

5 AmP in human pulmonary microvascular endothelial cells. 

Cultures were fixed with 4% formaldehyde, permeabilized with 0.2% 
Triton X-100 in BSA/PBS, incubated with primary antibody (1:100 dilution) 
(monoclonal anti-GP-AmP, overnight at 4°C) followed by secondary 
antibody (FITC-labeled goat anti-mouse IgG, 1 hour at 22°C) and then 
10 analyzed and photographed using a Biorad confocal microscope. 
Transient expression of AmP in COS-1 cells. 
The full length 3.5kb cDNA encoding human kidney AmP can be 
inserted into the expression vector pBKCMV as described for pig kidney 
AmP cDNA (113). The orientation of the insert can be verified by a 
1 5 directional PCR reaction, and the correct construct can be used to transfect 
COS-1 cells. About 2E + 06 cells are plated in 1 50 cm 2 culture flasks and 
allowed to proliferate for 24h at 37°C (113). Cells are then washed with 
Optl-Mem and transfected (5 ug of DNA/flask) using lipofectAmine. After 
2h at 37°C, Dulbecco's modified Eagle's medium containing 10% FCS is 
20 added. Twenty four h later, the medium is replaced with fresh, and the cells 
will be incubated at 37°C for another 24h. 

Parallel control cultures, transfected with vector lacking the AmP 
cDNA insert, can be processed similarly. A small portion of control and test 
cells are harvested and examined by indirect immunofluorescence using our 
25 monoclonal anti-AmP HL510 as the primary antibody. A second portion of 
each of the control and test cells are washed free of culture medium and then 
resuspended in 50 mM Hepes/NaOH buffer, pH 7.4, containing 0.15 M NaCl 
(assay buffer). The AmP substrate Arg-Pro-Pro-[ 3 H]benzylamide (21) is 
added to the cell suspension to a final concentration of 20 nM (1 uCi/ml). 
30 The cell/substrate reaction mixture is incubated at 37°C, and aliquots is 

collected at timed intervals for measurement of the rate of formation of the 
expected product, Pro-Pro-^benzylamide (21,22). AmP activity can be 
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computed to yield the first order rate constant, Vmax/Km. COS-1 control 
cells (transfected with vector lacking the AmP cDNA insert) do not express 
AmP (113); thus we expect to be able to detect even low levels of expression 
of human kidney AmP. 
5 The bulk of the COS-1 cells can be worked up to prepare cell 

membranes. The cells, in assay buffer containing 10 |4g/ml of each of 
pepstatin, leupeptin and aprotinin, are homogenized, and the homogenate is 
subjected to differential centrifugation (32, 1 10, 1 1 1). The cell membrane- 
enriched fraction is assayed for AmP catalytic activity (see above), and then 

1 0 solubilized with 60 mM octyl glucoside (76). Half of the resulting mixture is 
treated with phosphatidylinositol-specific phospholipase C (recombinant PI- 
PLC from B. thuringiensis) before phase separation with Triton X-l 14, and 
the remaining half is directly phase separated. If, as expected, the expressed 
AmP possesses a glycosyl phosphatidylinositol (gpi) lipid anchor, PI-PLC 

1 5 treatment should convert the amphipathic form (partitioned into the Triton 
phase) into the hydrophilic form. Both amphipathic and hydrophilic forms 
are subjected to SDS/PAGE under reducing conditions (10% gel). The 
proteins will be transferred on to Immobilon P for western blot analysis 
using anti-AmP HL510. Expressed monomeric AmP is expected to have an 

20 Mr near 90,000 (22,32, 1 1 1 , 1 52, 1 80). 

Overexpression of AmP. 

The baculovirus/Sf9 insect cell system, a system known to be capable 
of expressing biologically functional, glycosylated gpi-anchored proteins, 
will be used to obtain human kidney AmP in milligram quantities. 

25 Recombinant human cluster of differentiation antigen CD59 has been thus 
obtained in milligram quantities, with not less than 98% of the product 
bearing the gpi anchor, as judged by Triton X-l 14 phase partitioning before 
and after treatment with PI-PLC (76). Approximately half of CD59 was 
anchored to the cell membrane, and the remainder was secreted into culture 

30 medium. The secreted CD59 was in amphipathic form and could be 
converted by PI-PLC into the hydrophilic form. CD59 was produced in 
three isoforms, all with the expected N-terminal amino acid sequence and all 
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bearing a gpi anchor. Apparently, the glycosylation process was 
overwhelmed by high protein expression such that the two smaller isoforms 
were inefficiently glycosylated. The ability of SO cells to N-glycosylate 
recombinant proteins at expected sites is well-recognized; however, the 
5 glycosyl groups are generally of the high mannose type (76). The efficiency 
of glycosylation improves with increasing time of culture, thus it may be 
useful to analyze samples, and harvest and replace if indicated, culture 
medium daily so as to collect separately secreted AmP isoforms that differ in 
terms of numbers and possibly types of glycosyl groups. As for glycosyl 
1 0 groups, the types of anchors attached to recombinant proteins are 

characteristic of Sf9 cells and can differ in structure from gpi anchors 
attached by, e.g., human kidney proximal tubule epithelial cells (76). 
Nonetheless, Sf9 cell-produced proteins have the expected full-length 
peptide, correctly folded and crosslinked by disulfide bonds. Recombinant 
1 5 enzymes thus produced are typically fully active (76, 1 94, 1 95). 
Overexpression of wild-type human AmP 

The cDNA sequence encoding human AmP can be subcloned into the 
polyhedrin-based plasmid transfer vector pVL1393 (Pharmingen). 
Recombinant transfer vector can then be cotransfected with Baculogold 
20 (Pharmingen) viral DNA into Sf9 insect cells (2 x 10 6 cells in monolayer). 
Six days after cotransfection, the cells can be harvested and expression of 
AmP examined by assay of catalytic activity, immunofluorescence and 
western blotting. Conditioned medium containing recombinant virus is used 
to reinfect Sf9 cells through 2-3 rounds of amplification to obtain a high titer 
25 virus stock (1E+08 virus particles/ml). Optimal conditions of multiplicity of 
infection and length of infection can be defined. Maximal expression is 
typically obtained after 3-4 days of infection, at which time conditioned 
medium can be harvested and worked up in parallel with the Sf9 cells for 
their contents of recombinant human AmP. Samples of conditioned medium 
30 can be collected at timed intervals before final harvest in order to monitor 
efficiency of glycosylation. N-glycosylation of AmP early in culture is 
expected to be relatively inefficient and may provide useful insights if 
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multiple isoforms are obtained at final harvest. Triton X-l 14 phase 
extractions can be performed to examine for the efficiency of gpi-anchor 
attachments. As for recombinant human CDS 9, it is expected that 
conditioned medium will contain substantial quantities of recombinant AmP 
5 in its amphipathic form (76). 

Purification of wild-type human AmP can be based primarily on the 
immunoaffinity procedure that we have described previously (32). However, 
two early group separation procedures may simplify purification and 
improve yields. In the first step, amphipathic protein is selected for by 

10 Triton X-l 14 phase separation. The Triton phase is collected, diluted and 
then treated with PI-PLC. In a second step, the PI-PLC-formed hydrophilic 
protein, expected to be N-glycosylated with high mannose side functions 
(76), is isolated on concanavalin A-Sepharose. AmP is eluted. The 
immunoaffinity purification step can then be performed using relatively low 

1 5 protein loads. The goal is to obtain at least 20 nmol (about 2 mg) of pure 
AmP per 150 cm 2 culture flask. The high titer virus stock produced as 
described above can be used to scale up production of recombinant AmP as 
needed. All of the following studies of wild-type AmP can be performed 
using less than 50 nmol of the pure protein. 

20 Characterization of wild-type AmP. 

Kinetics. 

Using Arg-Pro-Pro-[ 3 H]benzylamide as substrate, kcat, Km and 
kcat/Km can be measured as described in the studies of guinea pig serum 
AmP (22). Pure recombinant wild-type human AmP is expected to have a 

25 second order rate constant, kcat/Km, on the order of 1.8E + 08 M" 1 min 1 

(22). In addition, kinetics of the reaction of AmP with bradykinin, Arg-Pro- 
Pro and Gly-Pro-Hyp can be characterized (22,1 1 1, 180). pH optimum and 
pH stability studies can be performed using Mes, Hepes and phosphate 
buffers. Recombinant AmP can be examined for expected responses to 

30 effectors such as Mn 2+ , EDTA, o-phenanthroline, p- 

hydroxymercuribenzoate, and dithiothreitol (22,1 1 1,1 13,152,180). 
Recombinant AmP can also be tested for thermal stability (152,180), not 
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only for comparison against naturally-occurring AmP, but also to set a 
baseline for characterizing potentially unstable mutants that lack disulfide 
bonds, glycosylation sites or metal ligands. 
Chemical properties. 
5 Incorrect estimations of the molar extinction coefficient of 

angiotensin converting enzyme (ACE) caused confusion for more than a 
decade, especially in terms of determination of the number of atoms of zinc 
per molecule of ACE and the specific activity of the pure enzyme (see 27 
and its references). To avoid such confusion for AmP, UV spectra (210-340 
1 0 nm) can be developed using three concentrations of wild-type AmP (optical 
densities of about 0.2, 0.5 and 1.0 at 280 nm). To enable accurate 
computation of AmP concentrations, a sample of each AmP preparation thus 
tested can be submitted to quantitative amino acid analysis. Special focus 
can be placed on histidine, which is expected to be recovered in a mole ratio 
1 5 (His/ AmP) of 1 2. When the molar extinction coefficient is established, it can 
be used to calibrate protein assay results obtainable by conventional Lowry, 
BCA and dye-binding methodologies. 

Recombinant AmP (1 nmol in a 1 mm light path cell) can also be 
characterized by circular dichroism. Spectra can be recorded at 13°C using a 
20 AVIV-60DS spectropolarimeter, and, with buffer baseline corrections, 
relative percentages of a-helix, (3-sheet, f3-turn and random coil structures 
can be estimated using AVIV software. The major purpose of these studies 
is to establish a basis for detecting variations in higher structure of unstable 
or catalytically-inactive mutants. 
25 Recombinant AmP can be analyzed by MALDI-TOF mass 

spectrometry to weigh the parent molecule and any dimer or trimer forms, 
and examine for characteristic fragmentation patterns that may later be useful 
for analyzing mutants (36,196). O-glycosidase can be used to rule in or out 
the presence of O-linked carbohydrate (196). The following text assumes 
3 0 that AmP does not contain O-linked carbohydrate, and the approach will 
require adjustments along obvious lines if the assumption is incorrect. At 
present, O-glycosylation seems unlikely in that exhaustive treatment of pig 
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AmP (PI-PLC solubilized) with N-glycosidase F yields a peptide of Mr 
71,000, essentially as expected for a 626 residue peptide plus a gpi-anchor 
remnant (196). 

Human kidney AmP contains a single Asp-Pro bond (D157-P158) 
5 (see SEQ ID NO:2) that is expected to hydro lyze spontaneously under the 
acid conditions required to form CNBr or BNPS skatole fragments (75,126). 
Since its spontaneous hydrolysis could complicate early efforts to interpret 
peptide fragment fingerprints, hydrolysis of AmP at D157-P158 should be 
attempted before beginning conventional fingerprinting. As described 

10 below, several analytical advantages accrue if the D-P bond can be 
hydrolyzed efficiently. 

Recombinant AmP, 0.1 nmol initially, is dissolved in 1 ml of 7M 
guanidinium chloride in 10% acetic acid adjusted to pH 2.5 with pyridine 
(126). The mixture is incubated at 37°C for up to 96h. At timed intervals, 

1 5 samples are examined by mass spectrometry and N-sequenced; the latter to 
monitor the rate of appearance of the new N-terminus, PFLL (residues 158- 
161). For the following, it is assumed for convenient discussion that K24 
(probably acylated) is the first residue of mature AmP and that A649 is the 
last. Elsewhere, these assumptions can be tested. The expected two pieces 

20 (N-piece, residues 24-157; and C-piece, residues 158-649) should be readily 
separated on Sephadex G-50. If reduction is required for separation of the 
N- and C-pieces, this will be evidence for the presence of a disulfide bond. 
The N-piece contains three potential N-glycosylation sites, N35, N49 and 
N65, and two Cys residues, C36 and C127. The N-piece is expected to have 

25 an N-oc acyl modification (22, 1 1 1, 180). If, in fact, the N-piece is resistant to 
Edman degradation, it can be digested with AspN to obtain a 43 amino acid 
(a. a.) residue peptide which contains C36 and potential glycosylation sites 
N35, N49 and N65. If, as predicted, K24 is the N-terminal residue of 
mature AmP, an acylated (possibly diacylated) tripeptide is expected, and its 

30 mass should reveal the identity of the acyl-function (125, 1 54, 1 89). If one or 
more of its glycosylation sites is glycosylated, the 43 residue peptide can be 
separated from the remainder of the AspN digest using con A-Sepharose (see 
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above). If C36 and C127 are linked by a disulfide, reduction of the high 
mannose fraction eluted from con A-Sepharose should yield a second peptide 
(residues 1 14-1 57), which can be identified by mass spectrometry. Mass 
spectrometry of the 43 a.a. residue peptide should also suggest, in terms of 
5 actual mass versus expected mass, whether one, two or all three of the 

potential N-glycosylation sites are glycosylated in fact. Edman degradation 
should make clear whether N35 and/or N49 are glycosylated. Given its 
distance from the AspN-generated N-terminus, N65 can be made more 
effectively accessible to Edman sequencing by cleavage of the M61-Q62 
10 bond with CNBr (41,48,49,75). The expected peptide, Q62-T74, can be 
weighed by mass spectrometry to determine whether N65 is or is not 
glycosylated (36). As is needed to isolate peptides of special interest (e.g. 
the AspN-generated N-terminal acyl-tripeptide) that cannot be collected on 
con A-Sepharose, reverse phase HPLC (Brownlee, aquapore 300) with a 
1 5 morpholine phosphate buffer, pH 6.5, as the mobile phase can be used (103). 
This system provides high resolution under conditions unlikely to damage 
the expected gpi-tail piece and unlikely to hydrolyze peptide bonds 
artifactually. 

Analysis of the potential N-glycosylation sites, and possible disulfide 
20 bonds, of the acid hydrolysis-produced C-piece (P158-A625) can proceed 
similarly. Exhaustive digestion with GluC is expected to yield two peptides 
containing potential N-glycosylation sites, a 41 a.a. residue peptide 
containing N278 (peptide T245-E285) and an 18 residue peptide (T286- 
E303) containing N291 and two Cys residues, C294 and C299. If 
25 glycosylated, both peptides should be susceptible to isolation from the GluC 
digest on con A-Sepharose. Since high mannose glycosyl groups are 
expected from a baculovirus/Sf9 expression system, failure of one or both 
peptides to bind to con A-Sepharose is presumptive evidence of the absence 
ofanN-glycosyl sidechain (26). If both T245-E285 andT286-E303 are 
30 isolated on con A-Sepharose, mass spectrometry can be used to verify that 
each of N278 and N291 is glycosylated. Edman degradation of T286-E303 
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may reveal whether C294 and C299 are, or are not, linked by a disulfide 
bond. 

GluC digestion of the C-piece (P158-A649) is expected to generate a 
relatively small C-terminal peptide, the last residue of which, in native AmP, 
is attached, via ethanolamine, to the gpi anchor (86,139-141,144,159,162). 
If GluC can hydrolyze an E-P bond, the C-terminal peptide is expected to be 
PLAA. If not, the GluC-generated C-terminal peptide is expected to be 
W639-A649. It should be possible to collect either peptide by 
immunoprecipitation. The PI-PLC-generated hydrophilic form of AmP is 
known to possess a C-terminal common recognition determinant (CRD) 
(29,32, 111,113,180). PI-PLC cleaves the phosphodiester bond between 
inositol and the diacylglycerol, forming a 1,2-cyclic phosphate ring on the 
inositol residue. The cyclic inositol phosphate is highly immunogenic, and 
antibodies prepared against any PI-PLC-solubilized protein cross-react with 
this epitope (the cross-reacting determinant, CRD) (86,139,140,208). We 
have one such antiserum (prepared against trypanosome variant surface 
glycoprotein) and have shown that it recognizes the CRD of PI-PLC- 
solubilized guinea pig kidney AmP and with an AmP peptide generated by 
LysC digestion (29,32). 

With the anti-CRD, one can isolate the C-terminal peptide of AmP 
(expected to be PLAA-CRD or W639-A649-CRD) from the above-described 
GluC digest by immunoprecipitation and then recover the free peptide-CRD 
by elution using buffer containing 1 mM 1,2-cyclic inositol phosphate 
(available from Sigma). By N-sequencing the recovered peptide to its 
ethanolamine moiety, one can establish unequivocally the exact gpi anchor 
attachment site. There is a caveat: the CRD is acid labile (86, 140) and may 
be damaged during the procedure used to hydrolyze the D157-P158 bond 
(see above). If in fact the AmP-CRD is destroyed, AmP (not previously 
exposed to strong acid) can be digested, in a separate experiment, with GluC 
and then isolate the CRD-bearing peptide as described above. 

Focus on cysteine residues. 

The foregoing chemical analysis will make it clear which of the 
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potential N-glycosylation sites of wild-type recombinant AmP are in fact 
glycosylated. As also noted, some clues may be gained on the presence and 
dispositions of disulfide bonds. However, unequivocal assignments of Cys 
residues taken up in disulfide bonding will require an independent approach, 

5 such as the following. 

Native recombinant AmP, 1 nmol, in 50 mM Hepes/NaOH buffer, 
pH 8.3 (21,22), can be reacted with 1,000 nmol of (l- 14 C)iodoacetamide, ~3 
Ci/mol, at 25°C for lh. Excess ,4 C-iodoacetamide is removed by centrifugal 
ultrafiltration (10K NMWL) with washing. Specific radioactivity can be 
1 0 measured by liquid scintillation counting to estimate the number of alkylated 
C residues. The 14 C-labeled protein product can then be acid-treated to 
hydrolyze the D157-P158 bond (see above) and recover the N- and C-pieces 
(respectively, K24-D157 and P158-A649). The N- and C-pieces (with or 
without a reducing agent) are separated, and their specific radioactivities 
1 5 measured. Following procedures described above, the N-piece can be 

digested with AspN to yield the 43 residue peptide that contains C36 and a 
44 residue peptide that contains C127. The former peptide, expected to be 
glycosylated, is separated from the latter on con A-Sepharose column 
chromatography. Each of the separated peptides can be assayed for its ,4 C- 
20 content. If the N-piece is itself not labeled with ,4 C, AspN digestion and 
subsequent studies will not be necessary. 

If the C-piece of alkylated AmP is labeled with 14 C, the focus should 
be on GluC-digest peptides containing residues C294 and C299 (peptide 
T286-E303) and C531 (peptide A505-E534). T286-E303, if glycosylated at 
25 N29 1 , should be easily separated on con A-Sepharose from A505-E534. If 
not, the 18 residue peptide should be readily separated from the 30 residue 
peptide by reverse phase HPLC. The separated peptides can be assayed for 
their contents of ,4 C. Near-neighbor C residue pairs are often linked by 
disulfide bonds (185). If this is true for C294 and C299, peptide T286-E303 
30 may be unlabeled. For reasons described below, C53 1 will be labeled with 
l4 C. 



39 



BNSDOCID* <WO 9911 799 A2_l_> 



SUBSTITUTE SHEET (RULE 26) 



A parallel experiment can be conducted in which native AmP, 
saturated with bradykinin (BK) (50 ^lM; Ki 1.1 (22,1 13)), is reacted with 
14 C-iodoacetamide as above. AmP is not a thiol protease and is not inhibited 
by iodoacetamide nor N-ethylmaleimide. However, it is partially (-70%) 
inhibited by p-hydroxy-mercuribenzoate, even with the latter at low 
concentration (~ 1 0 nM) (2 1 ,22, 1 1 1 , 1 80). Given that C53 1 is situated more 
or less in the middle of the putative catalytic metal ligands, D450, D461, 
H520, E555 and E569, it is highly plausible that p-hydroxymercuribenzoate 
binds to C531 and sterically hinders substrate binding and/or interferes with 
appropriate ligation of catalytic metal to the peptide backbone. By saturating 
AmP with its high affinity substrate BK, alkylation of C53 1 by 14 C- 
iodoacetamide should be prevented or strongly inhibited. 

A third experiment can be performed in which native AmP is reacted 
with 14 C-iodoacetamide as in the first experiment. After lh at 25°C, excess 
I4 C-iodoacetamide is removed and then the 14 C-labeled AmP is denatured 
and reduced (185). The reduced peptide is treated with vinyl pyridine. The 
subsequent work up can proceed as in the first experiment to obtain AspN 
peptides of the N-piece and GluC peptides of the C-piece (pieces produced 
by acid hydrolysis of D157-P158). The relevant peptides can then be N- 
sequenced to determine which C residues were alkylated with iodoacetamide 
and which, after reduction, were covalently-bound to vinylpyridine. 

Further studies to clarify dispositions of disulfides will depend on 
results obtained to this point. For example, if C53 1 is accessible to 14 C- 
iodoacetamide and C36, CI 27, C294, and C299 are modified only after 
reduction by vinylpyridine, the obvious possibilities for two disulfide bonds 
can be examined. One can analyse existing date to discern among the six 
possibilities (C36->C127, C36->C294, C36->C299, C127->C294, 
C127->C299, and C294->C299). For example, in experiments 1 and 2 after 
hydrolysis of the D127-P158 bond, was it necessary to add a reducing agent 
to separate the N-piece from the C-piece? If the N-piece and C-piece were 
separable without reduction, the disulfides are most likely to link 
C36->C127 and C294->C299. If more than one Cys is in reduced form, 
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there cannot be fewer than three reduced Cys residues, in which case there 
cannot be more than one disulfide bond. In the latter scenario, the two 
disulfide-linked Cys residues can be identified as their vinylpyridine 
derivatives. Sturrock et al (185) have recently detailed a MALDI-TOF mass 
5 spectrometry approach for locating disulfide bonds which we plan to use if 
our simpler plans yield equivocal results. Dr. Nancy D. Denslow has 
recently developed a procedure in which a target protein is hydrolyzed by 
reacting reduced Cys residues with DTNB (36). 

If C36 immediately follows an N-glycosylation site, N35, and may, 
10 if in reduced form, be sterically-hindered and inaccessible to 14 C- 

iodoacetamide, this anomalous behavior can be clarified, if encountered, by 
reacting AmP with N-glycanase before treatment with 14 C-iodoacetamide. 
Similarly, one can examine for sterically-hindered reduced Cys residues by 
titrating native and denatured AmP with Ellman's reagent (185). 
1 5 Mutant forms of AmP. 

The baculovirus/Sf9 expression system can also be used to produce 
mutant forms of AmP. The mutants are selected to help clarify catalytic 
function in terms of roles of the putative protein shuttle, H430, and putative 
catalytic metal ligands, D450, D461, H520, E555 and E569. In addition, one 
20 can examine roles played by glycosyl groups and disulfides in AmP function. 
Initially, site-specific mutations will be introduced into the wild-type human 
AmP cDNA sequence by the PCR-based splicing-by-overlap-extension 
technique described by Ho et al (105). Incorporation of the desired 
mutations can be confirmed by directional PCR. Mutant proteins will then 
25 be expressed in the baculovirus/Sf9 insect cell system under the conditions 
established for expression of the wild-type enzyme. The cells themselves 
will be examined by catalytic assay and immunofluorescence. Mutant 
proteins will be purified and analyzed as described above for wild-type AmP. 
All mutants will be characterized by mass spectrometry, UV spectrometry, 
30 circular dichroism, quantitative amino acid analysis and fingerprinting of 
peptide fragments (mass spectrometry and SDS-P AGE with and without a 
reducing agent). Catalytically-active mutants will be characterized to 
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measure kcat, Km and kcat/Km using Arg-Pro-Pro-[ 3 H]benzylamide, 
bradykinin and Gly-Pro-Hyp as substrates (22, 113,1 80). Temperature and 
pH stabilities will be defined (180). 

The first mutant to be prepared is one in which the putative proton 
shuttle, H430, is replaced with F. If H430 is in fact the proton shuttle, the 
F430 mutant is expected to be essentially inactive. Pig kidney AmP is 
completely inactivated by diethylpyrocarbonate in a concentration that 
derivatizes two H residues per molecule of AmP (134). Activity is restored 
by treatment of the derivatized AmP with hydroxylamine. It is plausible that 
H430 is accessible to diethylpyrocarbonate. 

The second mutant to be prepared is that in which the putative 
catalytic metal ligand H520 (also likely to be accessible to 
diethylpyrocarbonate) is replaced with F. One will then proceed to obtain 
mutants for each of the remaining putative four metal ligands as follows: 
D450->N, D461->N, E555->Q, and E569->Q. One will thereafter focus on 
obtaining mutants lacking potential N-glycosylation sites. Five mutants will 
be prepared: N35-*Q, N49-»Q, N65-»Q, N278-»Q and N291-»Q. Our 
objective here is to determine whether glycosyl groups indirectly support 
catalytic activity, perhaps in terms of maintaining structure, stability and 
solubility. Should the glycosyl groups effect catalytic function little or not at 
all, it may be feasible in a future grant period to obtain a catalytically-active 
"deglycosylated" AmP amenable to x-ray crystallography analysis. 

To obtain complementary data on roles of Cys residues and disulfide 
bonds, one will prepare five C-»S mutants (C36, CI 27, C294, C299 and 
C53 1). Characterization of these mutants should reveal roles of disulfide 
bonds in maintaining higher structure. In addition, the C531->S mutant may 
help clarify the anomalous partial inhibition of wild-type AmP by p- 
hydroxymercuribenzoate (pHMB): The S531 mutant is expected to be 
catalytically-active and resistant to inhibition by pHMB. 

One plans to use both site-specific mutation and deletion mutation to 
characterize the C-terminus of AmP. For example, A648 and A649 (the 
postulated gpi anchor attachment residue) will be replaced with R residues or 
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simply deleted. These studies may also be guided by results of a parallel 
study. In the latter study, one plans to compare membrane-bound AmP with 
apparently soluble forms of AmP. AmPs in astrocytes, platelets, heart, 
adrenal medulla and lymphocytes appear to be soluble enzymes of cytosol 
5 (98 101,106,147,148,152,165,191,192). Conceivably, soluble AmP is the 
product of a different gene. However, it is also conceivable that alternative 
processing occurs such that, e.g., kidney and heart AmPs differ in their C- 
terminal sequences. To test the latter possibility, one will prepare sense 
primers to, with the antisense APT primer of the 3'RACE system, obtain the 
10 nucleotide sequence(s) of soluble AmP cDNA from human kidney AmP 
cDNA nucleotide 2070 to the poly A tail. For these purposes, one have 
prepared poly A RNA's of human heart, adrenal gland and brain. It may be 
relevant that residues 643-646 (HTEP) closely resemble a known cell 
retention sequence signal (HTEL) that directs some liver carboxylases 
1 5 to storage in the endoplasmic reticulum (158). 

Cellular and subcellular dispositions of human aminopeptidase P. 
To a large degree, the functions of AmP in integrative biology are 
likely to be determined by its anatomical dispositions. Like other 
exopeptidases, AmP is selective, but not specific, in terms of substrate 
20 hydrolysis. In these terms, anatomical distribution can be understood to 
restrict access of AmP to those substrates available in the cellular or 
extracellular compartment in which the catalytic site is disposed. Thus, AmP 
disposed on small intestine brush border epithelium could plausibly function 
as a digestive enzyme that facilitates breakdown of collagenous foodstuffs, 
25 whereas AmP disposed on renal proximal tubule epithelium may function to 
process filtered peptides so as to conserve amino acids and modulate effects 
of some peptide hormones. Thus, the first objective is to determine by 
immunocytochemistry at the level of electron microscopy anatomical 
dispositions of AmP and orientation of its catalytic site. 
30 From another perspective, anatomical disposition of a given protein 

can be a determinant of secondary or tertiary reactions conducted by "near- 
neighbor" molecules, a concept well-recognized in terms of receptors and 
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coupled signaling proteins. Given that AmP is probably disposed in part in 
specialized cell membrane domains (e.g. in endothelial caveolae) believed to 
play key roles in cell signaling (74,92,121,131,136,137,173-175), the second 
objective is to help define morphologically "near-neighbors" of AmP whose 
functions may reasonably be influenced by reactions catalyzed by AmP. 
Preparation of antibodies. 

Monoclonal antibody HL510, prepared against guinea pig serum 
AmP (22,32), is reactive with human AmP and has been used in 
immunofluorescence studies to localize AmP on human endothelial cells 
(34). In the short term, one will continue to use HL5 10 for 
immunocytochemistry; however, one will in parallel prepare antibodies 
against specific peptide sequences in human kidney AmP that are predicted 
to be highly antigenic. For the latter search, one used the EGCG program 
(Wellcome Trust Genome) to identify antigenic sequences (1 12). The goal is 
to obtain at least one high affinity antiserum to a known epitope that does 
not occur in other proteins of the "pita bread" family of proteins (59). 

AmP peptide E285-W323, which contains one potential N- 
glycosylation site and two C residues that may be disulfide-linked, will be 
the first tested. The 39-amino acid residue peptide will be synthesized by the 
University of Florida peptide synthesis facility. The free peptide and the 
peptide coupled to polylysine will be used as immunogens. The monoclonal 
antibody facility will immunize five mice with each immunogen. Antibody 
titers will be measured by ELISA. Typically, one of a group of five mice is 
superior in terms of antibody response (titer and affinity) and provides a 
basis for choosing which mouse to use for preparing hybridomas. 

Antibody isotype will be determined, and octyl glucoside-treated 
homogenate of human kidney cortex (from the National Disease Research 
Interchange/Human Biological Data Interchange, NDRI) will be used for 
SDS-PAGE and western blotting. The homogenate will also be used for 
protein A-Sepharose immunoprecipitation of AmP. Part of the 
immunoprecipitate will be denatured and subjected to SDS-PAGE to 
examine for the expected Mr 90,000 protein. The remainder will be packed 
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into a small column and then washed with 0.1 M ethanolamine to separate 
native AmP from antibody (1 13). The elated protein will be examined for 
AmP catalytic activity using Arg-Pro-Pro-[ 3 H]benzylamide as substrate (21). 
The goal is to obtain a specific antibody capable of binding human AmP at 
5 an affinity sufficiently high to enable immunocytochemistry studies and 
immunoaffinity purifications of native AmP from a range of human tissue 
sources The immediate work plan focuses largely on determining cellular 
and subcellular dispositions of AmP. Longer term, the antibodies to AmP 
will be useful for other purposes such as epidemiologic surveys for AmP 
deficiency states (62). The first immunogen, E285-W323, contains a 
tyrosine residue and could therefore be readily labeled with 125 I for 
development of a competitive radioimmunoassay for AmP. If the first 
peptide immunogen fails to yield an antibody capable of 
immunoprecipitating native human AmP, one will prepare alternative 
antigenic sequences; in order of predicted high scores: T38-T51, P582-R597 
and (if needed) L568-K578. 

For the reasons stated above, one prefers to use relatively small 
antigenic, unique peptide sequences for preparing anti-AmP. If necessary, 
however, one will use recombinant wild-type human kidney AmP to prepare 
a large peptide antigen. The N-terminal third of AmP is unique in 
comparison with sequences of other members of the 'pita bread" protein 
family (59) Thus, it should be possible to prepare a specific anti-AmP by 
using as immunogen the N- P iece of AmP formed by acid hydrolysis of the 
D157-P158 bond. When anti-human AmP becomes available, one will 
prepare an immunoaffinity chromatography matrix (32) to obtain pure native 
AmP from kidney and other tissues. Native human kidney AmP will be 
compared with recombinant wild-type AmP. 

Antibodies will be purified on DE-52 cellulose (5,6). As necessary, 
specific anti-AmP will be immunoabsorbed on antigen covalently bound on 
30 Sepharose or the original peptide synthesis resin and then eluted with 0.1 M 
ethanolamine (32,1 13). Initially, one will use second antibody conjugates 
for immunochemical studies. However, it has been argued that 
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crosslinking of primary antibodies by second antibodies may cause cell 
membrane antigens to move into caveolae (143,145,153). If the subcellular 
localization AmP appears to be influenced by second antibody, one will 
conjugate AmP directly. In our previous studies of the subcellular 
5 distribution of angiotensin converting enzyme (ACE), one developed means 
of conjugating anti-ACE to octapeptide microperoxidase via a bifunctional 
active ester (6). The same labeling procedure will be used for anti-AmP. 
Cellular and subcellular dispositions of AmP. 
Immunocytochemistry studies at the level of light microscopy were 
1 0 described above. A major objective now is to define the dispositions of 
human AmP at the cellular and subcellular levels. The need for high 
resolution studies can be illustrated as follows: Light micrographs of lung 
tissue indicate that anti-AmP is captured at sites throughout the alveolar- 
capillary unit and on endothelium of small arteries and veins. The high 

1 5 resolution of electron microscopy is required to define the actual cellular and 
subcellular disposition(s) of AmP. 

In addition to the need to distinguish which of the cell-types of the 
alveolar capillary unit possess AmP, there are two other questions raised by 
our light microscopy studies. Unlike pulmonary ACE, which is disposed on 

20 endothelium, one has detected AmP immunoreactivity in association with 
airway epithelial cells and mononuclear leukocytes. Thus, EM studies are 
needed to identify the host epithelial cells and leukocytes and to determine 
whether AmP is disposed on or within the cells. Anticipating that 
glutaraldehyde-based fixatives may mask AmP epitopes (as was the case for 

25 ACE; 5), one conducted all of the light microscopy immunocytochemical 
studies using fresh tissues (frozen sections) and tissues fixed in picric 
acid/paraformaldehyde; a fixative adequate for moderately high resolution 
electron microscopy (5,6). Further one showed that the apparent dispositions 
of AmP epitopes were not changed by fixation, and one showed that mouse 

3 0 monoclonal anti-AmP (HL5 1 0) was not inferior to mouse polyclonal anti- 
AmP for our immunocytochemical purposes. One can therefore proceed 
from light microscopy studies directly to EM immunocytochemistry of 
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human tissues using picric acid/paraformaldehyde-fixed tissues reacted with 
monoclonal anti-AmP HL510 (IgG, isotype). Antibodies to human AmP 
antigenic amino acid sequences (see above) will be prepared for final studies. 
Initially, one will use second antibody conjugates as markers (conjugates of 
5 rabbit anti-mouse IgG, and, separately, goat anti-mouse IgG,). The second 
antibodies will be labeled with colloidal gold (5 or 20 nm) (Goldmark), and 
reacted tissues will then be prepared for EM as one have described elsewhere 
(5,6). Alternatively, primary antibodies labeled with 5 or 20 nm colloidal 
gold can be used. Negative controls will include omission of anti-AmP and 
10 substitution of the specific antibody with mouse IgG, anti-theophylline (the 
latter irrelevant antibody to examine for Fc receptors). Anti-AmP previously 
saturated with AmP will also be used. Positive controls will include use, as 
the first antibody, monoclonal mouse anti-ACE (an IgM) and polyclonal 
rabbit anti-fibronectin (30). 
1 5 The positive control studies will provide a basis for comparison of 

the disposition(s) of AmP with a marker known to occur on the luminal 
surface of endothelium (ACE) and with a marker known to be disposed m 
large part in the extracellular matrix (fibronectin). AmP is believed to be 
disposed in part on the endothelial surface (3 1 ,34,39,44,46). In addition, 
20 AmP is believed to be among the enzymes that degrade collagen fragments 
produced by collagenase (165,204,205); thus, some AmP may be disposed 
near collagen matrix. To label AmP and fibronectin in the same experiment, 
one will use two differently conjugated second antibodies; e.g., rabbit anti- 
mouse IgG,-5nm colloidal gold for AmP and goat anti-rabbit IgG-20 nm 
25 colloidal gold (Zymed) for fibronectin. In addition to the monoclonal anu- 
ACE noted above, one have a polyclonal rabbit anti-ACE that will be used 
similarly for the co-localizations of AmP and ACE. 

Our mouse anti-guinea pig AmP binds human AmP (32), but at 
relatively low affinity. The polyclonal mouse anti-human AmP is expected 
30 to have a much higher affinity, and one or more of the monoclonal antibodies 
may as well. Immunocytochemical localizations of AmP will use human 
tissues (fromNDRI); kidney, small intestine, liver, heart, lymphocytes, 
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platelets, bone marrow and lungs fresh-fixed in picric 
acid/paraformaldehyde. Similarly, Clonetics also supplies human renal 
proximal tubule epithelial cells and endothelial cells from aorta, pulmonary 
artery and lung microvasculature, all of which one will use for comparison 
studies. 

Cells in culture provide special opportunities for EM 
immunocytochemistry. As shown previously, cells in monolayer culture can 
be examined in cross section and as whole cell mounts (10). For example, 
one showed using cross sections that calmodulin is disposed in endothelial 
caveoiae. By high voltage EM of permeabilized whole endothelial cells 
viewed on face, calmodulin was found disposed in tracts of caveoiae, along 
microfiloments and in cleavage furrows of dividing cells. Thus, using renal 
proximal tubule epithelial and vascular endothelial cells in culture, one can 
localize AmP bound to cell membrane and/or disposed in intracellular 
compartments. If, in fact, soluble forms of AmP are reactive with our anti- 
human kidney AmP, one should be able to localize AmP within lymphocytes 
and platelets permeabilized after fixation. 

Membrane-bound forms of AmP. 

Human kidney cortex will be the first tissue to be examined. Our 
light microscopy studies indicate that the vast preponderance of renal AmP is 
associated with renal proximal tubule epithelium, with lesser amounts being 
distributed on all endothelia except for glomerular endothelium (39). At 
present, relatively little is known of the subcellular distribution of gpi- 
anchored proteins on specialized epithelia, thus our findings on the 
disposition(s) of AmP may be instructive in terms of other gpi-anchored 
proteins, such as membrane dipeptidase (109,1 10). For reasons presented 
above, one expects that most renal AmP will be shown to be an ectoenzyme 
with its catalytic site oriented to the luminal space. Fixed tissue will also be 
permeabilized with 1% Triton X-100 to facilitate detection of AmP in 
intracellular sites (10). Proximal tubule epithelial cells in culture will be 
examined similarly and tested in addition for AmP catalytic activity. Intact 
and permeabilized fixed cells will be examined in cross section and en face. 
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Endothelial-associated AmP is expected to be disposed as an ectoenzyme 
and, as a gpi-anchored protein, may be disposed largely in caveolae 
(121,171,175). 

AmP appears by light microscopy (39) to be disposed on brush 
5 border epithelium and endothelium of the villus vascular core of the small 
intestine. Lung tissue will be examined next, as described above. 
Separately, fixed cultures of endothelial cells from aorta, pulmonary artery 
and lung microvasculature (all from Clonetics) will be examined, intact and 
permeabilized. 
10 Soluble forms of AmP. 

It is not yet known whether the soluble forms of AmP that are found 
in lymphocytes, platelets, neuronal tissues and adrenal medulla 
(98 147,165,191,192) are alternative products of the same gene that encodes 
membrane-bound forms. To gain insight into the question, one will attempt 
1 5 immunoprecipitation of these soluble AmPs using antibodies to human 
kidney AmP (see above). Clearly, if the immunoprecipitations are 
successful, one will have a basis for proceeding to immunochemical 
localization using the target tissues fixed and permeabilized. As a further 
step, one will examine poly AKNAs of lymphocytes and adrenal medulla by 
20 RT-PCR using nested primers designed from kidney AmP cDN A. The sense 
and antisense primers will be selected to cover sequence from just upstream 
of the putative proton shuttle (H430) to a downstream site just 3' to the last 
putative metal ligand (E569). The PCR product, if obtained, will be 
sequenced. If, in fact, lymphocyte and adrenal medulla AmPs are encoded 
25 over their putative "pita bread" domains as is kidney AmP, one will (as 
described above) examine by 3'RACE RT-PCR for alternative C-terminal 
sequences that may direct soluble AmPs to intracellular sites. 

Simmons has reported that human heart AmP is soluble (152), and 
this may be true for liver AmP as well. However, the possibility is not ruled 
30 out that heart and liver contain an abundance of phospholipase C or D that, 
during the homogenization process, converts amphipathic AmP into a 
hydrophilic form. Immunocytochemistry studies should help resolve this 
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question. Even if soluble and normally stored in intracellular sites, human 
heart and liver AmP must have a strong structural resemblance to kidney 
AmP: On Northern blotting using human kidney AmP cDNA that encodes 
the kidney AmP sequence R123-A478, heart and liver poly A RNAs were 
found to be highly reactive (47). Further, each had a single message of the 
size, 3 .5kb, of the kidney AmP RNA. If heart and liver AmPs are, in intact 
tissue, disposed intracellularly, it should be a straightforward matter to 
identify a C-terminal sequence signal that directs cell retention. 

Membrane-bound AmP and its nearest neighbors. 

Reactions at the cell surface can set off a cascade of secondary, 
tertiary and higher reactions that are determined in part by the physical 
proximity and fit of downstream protein reactants. Receptor activation and 
subsequent cell signaling is perhaps the clearest example 
(63,74,130,137,142), especially for receptors with seven transmembrane 
domains with intracellular peptide loops, one or more of which can be 
phosphorylated and dephosphorylated (63). There is an abundant and 
growing literature describing close chemical and biochemical associations 
between ligand-bound receptors, signaling molecules and caveolins (e.g. see 
92,130,131,137,138,160,171,173,175,183). An objective in this subproject 
is to develop morphologic means of documenting close anatomical 
associations of functionally-related molecules. 

Increasingly, it appears that reactions involving cell surface gpi- 
anchored proteins can also set off a cascade of events. Gpi-anchored T-cell 
receptor is coupled to Src-family kinases (142). An insulin-dependent gpi- 
anchor hydrolysis has been described and leads to generation of inositol 
phosphoglycan (IPG) second messengers (142). Purified IPGs alone can 
mimic insulin activities. Through the same, or a parallel pathway, insulin 
stimulates a tyrosine phosphorylation of caveolin (142). 

Previously, it was shown that bradykinin (BK), in concentrations as 
low as 10 pM, causes endothelial cells to mobilize arachidonate, some of 
which is converted into thromboxane A 2 (8-1 1). Des-Arg'-BK, the product 
formed by AmP, is the only lower homolog of BK, in a near-comparable 
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concentration, capable of mobilizing endothelial arachidonate. Possibly of 
relevance, des-Arg'-BK has as great an affinity for AmP as BK itself. It is a 
tenable speculation that a BK-dependent gpi-anchor hydrolysis exists, a gpi- 
anchor at issue is that of AmP, and part or all of the arachidonate mobilized 
5 comes from the diacylglycerol formed by gpi-anchor hydrolysis. In these 
terms, BK may exert some of its effects via AmP. 

It is now widely believed that most of the biological effects of BK are 
initiated by activation of the B2 receptor and are mediated through 
calmodulin-dependent eNOS and guanylate cyclase (see 63 and its 
10 references). If, as expected, endothelial AmP is largely restricted to 

caveolae, it is well-positioned for at least indirect linkage with eNOS and 
guanylate cyclase, which appear to be largely disposed on the cytoplasmic 
aspect of caveolae (92,131). Endothelial cells respond to BK as if they have 
B2 receptors; however the subcellular dispositions of B2 receptors have, to 
15 our knowledge, never been defined at the level of electron microscopy. 

If, in fact, binding of BK to the B2 receptor activates eNOS and 
guanylate cyclase (as opposed to binding of BK to an alternative effector), 
the B2 receptor is likely in very close physical proximity. Our immediate 
objective here is to develop a novel perspective on how BK exerts effects on 
20 endothelium by helping to define nearest-neighbors of AmP and the B2 

receptors. Such data as are available indicate that the B2 receptor is situated 
on or within a cell membrane microdomain, perhaps in caveolae, that can be 
rapidly taken up by endocytosis (63). 

It is proposed to use endothelial cell plasma membrane/caveolae 
25 fractions prepared as described previously (3,4,8,1 1). In brief, post- 
confluent endothelial cells in culture (which contain caveolae in large 
numbers (7,8,10,1 1)) will be harvested with a rubber spatula, homogenized, 
and then centrifuged to remove nuclei and cell debris. The supernatant will 
then be reacted with 5'-adenosine monophosphate (5'-AMP) in the presence 
30 of lead nitrate. Caveolar 5'-AMPase, a gpi-anchored protein, hydrolyzes 5'- 
AMP to form adenosine and Pi. Pi is precipitated as lead phosphate within 
caveolae and thereby greatly increases the density of the plasma 
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membrane/caveolae fraction. The latter fraction is then easily separated 
from soluble proteins and other membrane systems by low g-force (~100xg) 
centrifugation of the reaction mixture through a relatively dense sucrose 
cushion (4). Remarkably, about 65% of the 5'-AMPase remains active, and 
angiotensin converting enzyme (3,4) and AmP activities (unpublished) are 
readily measured. 

The resulting endothelial cell membrane/caveolae "ghosts" provide a 
number of advantages for our present purposes. Firstly, antigenic sites on 
both the extracellular and cytoplasmic aspects of the cell membrane are 
accessible to added antibodies. Secondly, the fraction contains both 
"unspecialized" cell membrane and attached caveolae. Should, contrary to 
expectations, the B2 receptor be disposed at sites outside of caveolae, one 
will find these sites. Thirdly, when of interest, the plasma 
membrane/caveolae fraction can be treated with Triton X-100 to form its 
1 5 Triton-soluble and Triton-insoluble subfractions ( 1 5 0, 1 75). AmP and 5 '- 
AMPase are expected to be enriched in the Triton-insoluble particulate. 

One can use a rabbit anti-human B2 receptor that binds to a 
cytoplasmic epitope of the B2 receptor, C361-Q395 (63). One or more of 
the serines of C361-Q395 is phosphorylated when the B2 receptor of human 
foreskin fibroblasts is reacted with BK (63). With our mouse anti-AmP and 
rabbit anti-B2 receptor preparations, one can localize the target antigens on 
human endothelial cell (Clonetics) plasma membrane/caveolae fractions 
using anti-mouse IgG conjugated to 5 nm colloidal gold and anti-rabbit IgG 
conjugated to 20 nm colloidal gold. If second antibody places gold particles 
too distant for assigning antigen sites, we will label primary antibodies (6). 

By the same approach, one can define the subcellular dispositions of 
AmP and the B2 receptor in respect to dispositions of eNOS and guanylate 
cyclase (using commercially-available anti-eNOS and anti-guanylate 
cyclase). Our aim is to develop and document a morphologic approach to 
complement biochemical data already in hand on the apparently tight 
physical association of signal transduction molecules believed to be disposed 
in association with caveolae (e.g. see 89,90,92, 1 3 0, 1 3 1 , 1 60, 1 73). In 
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addition, our approach should also help clarify anatomic associations 
between proteins disposed on the extracellular aspect of the plasma 
membrane with functionally-linked counterparts disposed on the cytoplasmic 
aspect. 

5 Success in this subproject can be exploited by us and others in terms 

of relating morphologically a host of other proteins of interest, including (but 
not limited to) the caveolins, Ca 2+ -ATPase, the H» 3 receptor, adenosine and 
prostaglandin transporters and heterotrimeric G proteins 
(90,130,132,137,173,184). 
10 Alternative Splicing. 

Complementary DNA clones encoding human membrane-bound 
AmP were isolated by reverse transcription-polymerase chain reaction (RT- 
PCR) of human kidney and lung poly(A)+ RNA. Northern hybridization 
analysis and RT-PCR suggests that the soluble and membrane-bound forms 
1 5 of human AmP are products of two distinct mRNAs which may be produced 
through alternative splicing, have different C-terminal sequences, lntronic 
sequences involved in such alternative splicing can be included in human 
AmP constructs to allow production of both forms of human AmP. In such 
constructs, it is preferred that sequences from only the specific introns 
20 involved in the alternative splicing be used. Such a construct is thus a 

cDNA/genomic hybrid construct, containing both cDNA and genomic DNA. 
The cDN A portion of such a construct lacks intronic sequences which are 
present in corresponding genomic sequences. 
Construction of Transgenic Animals. 
25 Animal Sources. 

Animals suitable for transgenic experiments can be obtained from 
standard commercial sources such as Charles River (Wilmington, MA), 
Taconic (Germantown, NY), and Harlan Sprague Dawley (Indianapolis, IN). 
Many strains are suitable, but Swiss Webster (Taconic) female mice are 
30 preferred for embryo retrieval and transfer. B6D2F, (Taconic) males can be 
used for mating and vasectomized Swiss Webster studs can be used to 
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stimulate pseudopregnancy. Vasectomized mice and rats can be obtained 
from the supplier. 

Microinjection Procedures. 

The procedures for manipulation of the rodent embryo and for 
5 microinjection of DNA are described in detail in Hogan et al., Manipulating 
the Mouse Embryo (Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1986), the teachings of which are incorporated herein. 

Transgenic Mice. 

Female mice six weeks of age are induced to superovulate with a 5 

1 0 IU injection (0. 1 cc, ip) of pregnant mare serum gonadotropin (PMSG; 
Sigma) followed 48 hours later by a 5 IU injection (0.1 cc, ip) of human 
chorionic gonadotropin (hCG; Sigma). Females are placed with males 
immediately after hCG injection. Twenty-one hours after hCG injection, the 
mated females are sacrificed by C0 2 asphyxiation or cervical dislocation and 

1 5 embryos are recovered from excised oviducts and placed in Dulbecco's 
phosphate buffered saline with 0.5% bovine serum albumin (BSA; Sigma). 
Surrounding cumulus cells are removed with hyaluronidase (1 mg/ml). 
Pronuclear embryos are then washed and placed in Earle's balanced salt 
solution containing 0.5% BSA (EBSS) in a 37.5°C incubator with a 

20 humidified atmosphere at 5% C0 2 , 95% air until the time of injection. 
Embryos can be implanted at the two cell stage. 

Randomly cycling adult female mice are paired with vasectomized 
males. Swiss Webster or other comparable strains can be used for this 
purpose. Recipient females are mated at the same time as donor females. At 

25 the time of embryo transfer, the recipient females are anesthetized with an 
intraperitoneal injection of 0.015 ml of 2.5% avertin per gram of body 
weight. The oviducts are exposed by a single midline dorsal incision. An 
incision is then made through the body wall directly over the oviduct. The 
ovarian bursa is then torn with watchmakers forceps. Embryos to be 

30 transferred are placed in DPBS (Dulbecco's phosphate buffered saline) and in 
the tip of a transfer pipet (about 10 to 12 embryos). The pipet tip is inserted 
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into the infundibulum and the embryos transferred. After the transfer, the 
incision is closed by two sutures. 
Transgenic Rats. 

The procedure for generating transgenic rats is similar to that of mice 
5 (Hammer et al., Cell 63:1099-1 12 (1990)). Thirty day-old female rats are 
given a subcutaneous injection of 20 IU of PMSG (0.1 cc) and 48 hours later 
each female placed with a proven male. At the same time, 40-80 day old 
females are placed in cages with vasectomized males. These will provide the 
foster mothers for embryo transfer. The next morning females are checked 
1 0 for vaginal plugs. Females who have mated with vasectomized males are 
held aside until the time of transfer. Donor females that have mated are 
sacrificed (C0 2 asphyxiation) and their oviducts removed, placed in DPBS 
(Dulbecco's phosphate buffered saline) with 0.5% BSA and the embryos 
collected. Cumulus cells surrounding the embryos are removed with 
1 5 hyaluronidase (1 mg/ml). The embryos are then washed and placed in EBSS 
(Earle's balanced salt solution) containing 0.5% BSA in a 37.5°C incubator 
until the time of microinjection. 

Once the embryos are injected, the live embryos are moved to DPBS 
for transfer into foster mothers. The foster mothers are anesthetized with 
20 ketamine (40 mg/kg, ip) and xylazine (5 mg/kg, ip). A dorsal midline 

incision is made through the skin and the ovary and oviduct are exposed by 
an incision through the muscle layer directly over the ovary. The ovarian 
bursa is torn, the embryos are picked up into the transfer pipet, and the tip of 
the transfer pipet is inserted into the infundibulum. Approximately 10 to 12 
25 embryos are transferred into each rat oviduct through the infundibulum. The 
incision is then closed with sutures, and the foster mothers are housed singly. 
Embryonic Stem (ES) Cell Methods. 
Introduction of DNA into ES cells. 

Methods for the culturing of ES cells and the subsequent production 
30 of transgenic animals, the introduction of DNA into ES cells by a variety of 
methods such as electroporation, calcium phosphate/DNA precipitation, and 
direct injection are described in detail in Teratocarcinomas and Embryonic 
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Stem Cells, A Practical Approach, ed. EJ. Robertson, (IRL Press 1987), the 
teachings of which are incorporated herein. Selection of the desired clone of 
transgene-containing ES cells can be accomplished through one of several 
means. For random gene integration, an AmP clone is co-precipitated with a 
gene encoding neomycin resistance. Transfection is carried out by one of 
several methods described in detail in Lovell-Badge, in Teratocarcinomas 
and Embryonic Stem Cells, A Practical Approach, ed. E J. Robertson, (IRL 
Press 1987), or in Potter et al, Proa Natl. Acad. ScL USA 81:7161 (1984). 
Lipofection can be performed using reagents such as provided in 
commercially available kits, for example DOTAP (Boehringer-Mannheim) 
or lipofectin (BRL). Calcium phosphate/DNA precipitation, lipofection, 
direct injection, and electroporation are the preferred methods. In these 
procedures, 0.5 X 10 6 ES cells are plated into tissue culture dishes and 
transfected with a mixture of the linearized AmP clone and 1 mg of pSV2neo 
DNA (Southern and Berg, J. Mol. Appl Gen. 1:327-341 (1982)) precipitated 
in the presence of 50 mg lipofectin (BRL) in a final volume of 100 \xl The 
cells are fed with selection medium containing 10% fetal bovine serum in 
DMEM supplemented with G418 (between 200 and 500 |Lig/ml). Colonies of 
cells resistant to G418 are isolated using cloning rings and expanded. DNA 
is extracted from drug resistant clones and Southern blots using an AmP 
cDNA probe can be used to identify those clones carrying the AmP 
sequences. PCR detection methods may also used to identify the clones of 
interest. 

DNA molecules introduced into ES cells can also be integrated into 
the chromosome through the process of homologous recombination, 
described by Capecchi (1989). Direct injection results in a high efficiency of 
integration. Desired clones can be identified through PCR of DNA prepared 
from pools of injected ES cells. Positive cells within the pools can be 
identified by PCR subsequent to cell cloning (Zimmer and Gruss, Nature 
338:150-153 (1989). DNA introduction by electroporation is less efficient 
and requires a selection step. Methods for positive selection of the 
recombination event (for example, neo resistance) and dual positive-negative 
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selection (for example, neo resistance and gancyclovir resistance) and the 
subsequent identification of the desired clones by PCR have been described 
by Joyner et ai, Nature 338:153-156 (1989), and Capecchi (1989), the 
teachings of which are incorporated herein. 
5 Embryo Recovery and ES cell Injection. 

Naturally cycling or superovulated female mice mated with males 
can be used to harvest embryos for the implantation of ES cells. It is 
desirable to use the C57BL/6 strain for this purpose when using mice. 
Embryos of the appropriate age are recovered approximately 3.5 days after 
1 0 successful mating. Mated females are sacrificed by C0 2 asphyxiation or 
cervical dislocation and embryos are flushed from excised uterine horns and 
placed in Dulbecco's modified essential medium plus 10% calf serum for 
injection with ES cells. Approximately 10 to 20 ES cells are injected into 
blastocysts using a glass microneedle with an internal diameter of 
1 5 approximately 20 u,m. 

Transfer of Embryos to Pseudopregnant Females. 
Randomly cycling adult female mice are paired with vasectomized 
males. Mouse strains such as Swiss Webster, ICR or others can be used for 
this purpose. Recipient females are mated such that they will be at 2.5 to 3.5 
20 days post-mating when required for implantation with blastocysts containing 
ES cells. At the time of embryo transfer, the recipient females are 
anesthetized with an intraperitoneal injection of 0.015 ml of 2.5% avertin per 
gram of body weight. The ovaries are exposed by making an incision in the 
body wall directly over the oviduct and the ovary and uterus are externalized. 
25 A hole is made in the uterine horn with a 25 gauge needle through which the 
blastocysts are transferred. After the transfer, the ovary and uterus are 
pushed back into the body and the incision is closed by two sutures. This 
procedure is repeated on the opposite side if additional transfers are to be 
made. 

30 Identification, Characterization, and Utilization of Transgenic Mice 

and Rats. 

Transgenic rodents can be identified by analyzing their DN A. For 
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this purpose, tail samples (1 to 2 cm) can be removed from three week old 
animals. DNA from these or other samples can then be prepared and 
analyzed by Southern blot, PCR, or slot blot to detect transgenic founder (F 0 ) 
animals and their progeny (Fi and F?). 

Disclosed is an isolated nucleic acid molecule encoding the amino 
acid sequence shown in SEQ ID NO:2, or a fragment of at least six amino 
acids of the amino acid sequence shown in SEQ ID NO:2. Preferably the 
nucleic acid molecule includes expression sequences, at least one intron, or 
both. Preferred forms of the nucleic acid molecule are SEQ ID NO: 1, SEQ 
ID NO:6, and nucleotides 1 to 29,271 of SEQ ID NO:6. Also disclosed are 
fragments of SEQ ID NO: 1, or fragments of the collective sequence 
represented by SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
and SEQ ID NO: 7 (the genomic sequence). It is preferred that the fragments 
contain at least 10 nucleotides, at least 15 nucleotides, at least 18 nucleotide, 
or at least 20 nucleotides. Also disclosed are aminopeptidase P regulatory 
sequences present SEQ ID NOs:3, 4, 5, 6, and 7. A preferred regulatory 
sequence is a fragment of SEQ ID NO:5 that promotes transcription of a 
nucleic acid segment operatively linked to the fragment. 

Also disclosed are proteins having the amino acid sequence shown in 
SEQ ID NO: 2 or a variant amino acid sequence where one or more amino 
acids shown in SEQ ID NO:2 are replaced with a conservative substitute 
amino acid. A preferred form of the protein has from one to ten amino acids 
shown in SEQ ID NO: 2 are replaced with a conservative substitute amino 
acid. Also disclosed are proteins including a portion of the amino acid 
sequence shown in SEQ ID NO:2 such that the protein is soluble in aqueous 
solution (also referred to as soluble aminopeptidase P). A protein having the 
amino acid sequence shown in SEQ ED NO:2 or a variant amino acid 
sequence, where the protein has aminopeptidase activity. Also disclosed are 
peptides including a fragment of at least six amino acids of the amino acid 
sequence shown in SEQ ID NO:2. Also disclosed are antibodies reactive 
with the disclosed proteins or peptides. 

Also disclosed is a method of detecting aminopeptidase P mutants 
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performed by comparing all or a part of a nucleotide sequence encoding 
aminopeptidase P with the corresponding nucleotide sequence of SEQ ID 
NO: 1, or the collective nucleotide sequence represented by SEQ ID NO:3, 
SEQ ID NO.4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO.7. Also 
5 disclosed is a method of identifying a compound that inhibits aminopeptidase 
P by bringing into contact cells and a compound to be tested, measuring the 
level of aminopeptidase P activity in the cells, and comparing the measured 
level of activity with the level of activity in cells not brought into contact 
with the compound to be tested. Also disclosed is a method of identifying a 
1 0 compound that inhibits aminopeptidase P expression by bringing into contact 
cells expressing aminopeptidase and a compound to be tested, measuring the 
level of aminopeptidase P expression in the cells, and comparing the 
measured level of expression with the level of expression in cells not brought 
into contact with the compound to be tested. 
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Publications cited herein and the material for which they are cited are 
specifically incorporated by reference. 

Those skilled in the art will recognize, or be able to ascertain using 
no more than routine experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such equivalents are 
intended to be encompassed by the following claims. 
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We claim: 

1. An isolated nucleic acid molecule encoding the amino acid 
sequence shown in SEQ ID NO:2, or a fragment of at least six amino acids 
of the amino acid sequence shown in SEQ ID NO:2. 

2. The nucleic acid molecule of claim 1 wherein the nucleic acid 
molecule comprises at least one intron. 

3. The nucleic acid molecule of claim 1 wherein the nucleic acid 
molecule comprises expression sequences. 

4. The nucleic acid molecule of claim 1 comprising SEQ ID NO .l. 

5. The nucleic acid molecule of claim 1 comprising nucleotides 1 to 

29,271 ofSEQIDNO:6. 

6. A protein comprising the amino acid sequence shown in SEQ ID 
NO:2 or a variant amino acid sequence where one or more amino acids 
shown in SEQ ID NO:2 are replaced with a conservative substitute amino 
acid. 

7. The protein of claim 6 wherein from one to ten amino acids shown 
in SEQ ID NO.2 are replaced with a conservative substitute amino acid. 

8. An antibody reactive with the protein of claim 6. 

9. A protein comprising a portion of the amino acid sequence shown 
in SEQ ID NO.2 such that the protein is soluble in aqueous solution. 

10. A protein comprising the amino acid sequence shown in SEQ ID 
NO.2 or a variant amino acid sequence, wherein the protein has 

aminopeptidase activity. 

1 1. A peptide comprising a fragment of at least six amino acids of the 

amino acid sequence shown in SEQ ID NO:2. 

12. An antibody reactive with the peptide of claim 11. 

13 . A nucleic acid molecule comprising a fragment of SEQ ID NO: 1, 
or a fragment of the collective sequence represented by SEQ ID N0 .3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7, wherein the 
fragment comprises at least 10 nucleotides. 

14. The nucleic acid molecule of claim 13 wherein the fragment 
comprises at least 15 nucleotides. 
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15. The nucleic acid molecule of claim 13 wherein the fragment comprises 
at least 20 nucleotides. 

16. A nucleic acid molecule comprising a fragment of SEQ ID NO:5 
wherein the fragment promotes transcription of a nucleic acid segment 
operatively linked to the fragment. 

17. A method of detecting aminopeptidase P mutants comprising 
comparing all or a part of a nucleotide sequence encoding aminopeptidase P 
with the corresponding nucleotide sequence of SEQ ID NO: 1, or the 
collective nucleotide sequence represented by SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7. 

18. A method of identifying a compound that inhibits aminopeptidase 
P, the method comprising bringing into contact cells and a compound to be 
tested, measuring the level of aminopeptidase P activity in the cells, and 
comparing the measured level of activity with the level of activity in cells 
not brought into contact with the compound to be tested. 

19. A method of identifying a compound that inhibits aminopeptidase 
P expression, the method comprising bringing into contact cells expressing 
aminopeptidase and a compound to be tested, measuring the level of 
aminopeptidase P expression in the cells, and comparing the measured level 
of expression with the level of expression in cells not brought into contact 
with the compound to be tested. 
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SEQUENCE LISTING 
<110> Medical College of Georgia Research Institute, Inc. 

<120> Human Aminopept idase P Gene 

<130> MCG103 

<140> 
<141> 

<150> 60/057,854 
<151> 1997-09-02 

<160> 7 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 3428 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (265) . . (2283) 
<400> 1 

caccctatcc tacactacta ggaacttgca cagtccgcct cgggcagccc aaagctcctc 60 

tgcccaccct ggctcccaaa accctccaaa acaaaagacc agaaaagcac tctccaccca 120 

gcagccaaac gcctccttct tgacgccagc ccccaccctc tgtctgctcg agcccaggaa 180 

aggcctgaag gaacaggccg gggaaggagc cctccctctc tcccttgtcc ctccatccac 24 0 

ccagcgccgg catctggaga ccct atg gcc egg get cac tgg ggc tgc tgc 291 

Met Ala Arg Ala His Trp Gly Cys Cys 
1 5 

ccc tgg ctg gtc etc etc tgt get tgt gcc tgg ggc cac aca aag cca 339 
Pro Trp Leu Val Leu Leu Cys Ala Cys Ala Trp Gly His Thr Lys Pro 
10 . « 20 25 

ctg gac ctt gga ggg cag gat gtg aga aat tgt tec acc aac ccc cct 387 
Leu Asp Leu Gly Gly Gin Asp Val Arg Asn Cys Ser Thr Asn Pro Pro 
30 35 40 

tac ctt cca gtt act gtg gtc aat acc aca atg tea etc aca gcc etc 435 
Tyr Leu Pro Val Thr Val Val Asn Thr Thr Met Ser Leu Thr Ala Leu 
45 50 55 

cgc cag cag atg cag acc cag aat etc tea gcc tac ate ate cca ggc 483 
Arg Gin Gin Met Gin Thr Gin Asn Leu Ser Ala Tyr He He Pro Glv 
60 65 70 
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aca gat get cac atg aac gag tac ate ggc caa cat gac gag agg cgt 531 
Thr Asp Ala His Met Asn Glu Tyr lie Gly Gin His Asp Glu Arg Arg 
75 80 85 

acq tgg att aca ggc ttt aca ggg tct gca gga act gca gtg gtg act 
All Trp lie Thr Gly Phe Thr Gly Ser Ala Gly Thr Ala Val Val Thr 



579 



atg aag aaa gca get gtc tgg acc gac agt cgc tac tgg act cag get 627 
Met Lys Lys Ala Ala Val Trp Thr Asp Ser Arg Tyr Trp Thr Gin Ala 
110 H5 120 

gag egg caa atg gac tgt aat tgg gag etc cat aag gaa gtt ggc acc 675 
Glu Arg Gin Met Asp Cys Asn Trp Glu Leu His Lys Glu Val Gly Thr 
125 130 I 35 

act cet att gtc acc tgg etc etc ace gag att ccc get gga ggg cgt 723 
Thr Pro He Val Thr Trp Leu Leu Thr Glu He Pro Ala Gly Gly Arg 
X40 145 150 

gtg ggt ttt gac ccc ttc etc ttg tec att gac acc tgg gag agt tat 771 
Val Gly Phe Asp Pro Phe Leu Leu Ser He Asp Thr Trp Glu Ser Tyr 
155 160 165 

gat ctg gee etc caa ggc tct aac aga cag ctg gtg tec ate aca acc 819 
Lp Leu Ala Leu Gin Gly Ser Asn Arg Gin Leu Val Ser He Thr Thr 
170 175 180 185 

aat ctt gtg gac ctg gta tgg gga tea gag agg cca ccg gtt cca aat 867 
III Leu Val Asp Leu Val Trp Gly Ser Glu Arg Pro Pro Val Pro Asn 
190 195 200 



caa ccc att tat gee ctg cag gag gca ttc aca ggg age act tgg cag 
Gin Pro He Tyr Ala Leu Gin Glu Ala Phe Thr Gly Ser Thr Trp Gin 

210 2 1 5 



205 



gag aaa gta tct ggc gtc cga age cag atg cag aag cat caa aag gtc 
Glu Lys Val Ser Gly Val Arg Ser Gin Met Gin Lys His Gin Lys Val 

225 230 



220 



ccg act gee gtc ctt ctg teg gcg ctt gag gag acg gee tgg etc ttc 
Pro Thr La Val Leu Leu Ser Ala Leu Glu Glu Thr Ala Trp Leu Phe 

24 0 245 



235 



aac ctt cga gee agt gac ate ccc tat aac ccc ttc ttc tat tec tac 
III Leu Arg Ala Ser Lp He Pro Tyr Asn Pro Phe Phe Tyr Ser Tyr 
250 255 260 



acq ctg etc aca gac tct tct att agg ttg ttt gca aac aag agt cgc 
S Leu Leu Thr Lp Ser Ser He Arg Leu Phe Ala Asn Lys Ser Arg 
270 275 280 

t-tt aac tec aaa acc ttg age tat ctg aac tec agt tgc aca ggc ccc 
Phe Ser Ser Glu Thr Leu Ser Tyr Leu Asn Ser Ser Cys Thr Gly Pro 
285 290 295 



915 



963 



1011 



1059 



1107 



1155 
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atg tgt gtg caa ate gag gat tac age caa gtt cgt gac age ate cag 1203 

Met Cys Val Gin He Glu Asp Tyr Ser Gin Val Arg Asp Ser He Gin 
300 305 310 

gee tac tea ttg gga gat gtg agg ate tgg att ggg acc age tat ace 1251 

Ala Tyr Ser Leu Gly Asp Val Arg He Trp He Gly Thr Ser Tyr Thr 



315 320 



325 



atg tat ggg ate tat gaa atg at a cca agg gag aaa etc gtg aca gac 12 99 
Met Tyr Gly He Tyr Glu Met He Pro Arg Glu Lys Leu Val Thr Asp 
330 335 340 345 

acc tac tec cca gtg atg atg acc aag gca gtg aag aac age aag gag 1347 
Thr Tyr Ser Pro Val Met Met Thr Lys Ala Val Lys Asn Ser Lys Glu 
350 355 360 

cag gec etc etc aag gee age cac gtg egg gac get gtg get gtg ate 1395 
Gin Ala Leu Leu Lys Ala Ser His Val Arg Asp Ala Val Ala Val He 
365 370 375 

egg tac ttg gtc tgg ctg gag aag aac gtg ccc aaa ggc aca gtg gat 1443 
Arg Tyr Leu Val Trp Leu Glu Lys Asn Val Pro Lys Gly Thr Val Asp 
380 385 390 

gag ttt teg ggg gca gag ate gtg gac aag ttc cga gga gaa gaa cag 14 91 
Glu Phe Ser Gly Ala Glu He Val Asp Lys Phe Arg Gly Glu Glu Gin 
395 400 405 

ttc tec tec gga ccc agt ttt gaa acc ate tct get agt ggt ttg aat 1539 
Phe Ser Ser Gly Pro Ser Phe Glu Thr He Ser Ala Ser Gly Leu Asn 
410 415 420 ' 425 

get gee ctg gee cac tac age ccg acc aag gag ctg aac cgc aag ctg 1587 
Ala Ala Leu Ala His Tyr Ser Pro Thr Lys Glu Leu Asn Arg Lys Leu 
430 435 440 

tec tea gat gag atg tac ctg ctg gac tct ggg ggg cag tac tgg gac 1635 
Ser Ser Asp Glu Met Tyr Leu Leu Asp Ser Gly Gly Gin Tyr Trp Asp 
445 450 ' 455 

ggg acc aca gac ate acc aga aca gtc cac tgg ggc acc ccc tct gee 1683 
Gly Thr Thr Asp He Thr Arg Thr Val His Trp Gly Thr Pro Ser Ala 
460 465 470 

ttt cag aag gag gca tat acc cgt gtg ctg ata gga aat att gac ctg 1731 
Phe Gin Lys Glu Ala Tyr Thr Arg Val Leu He Gly Asn He Asp Leu 
475 480 485 

tec agg etc ate ttt ccc get get aca tea ggg cga atg gtg gag gee 1779 
Ser Arg Leu He Phe Pro Ala Ala Thr Ser Gly Arg Met Val Glu Ala 
490 495 500 505 

ttt gee cgc aga gee ttg tgg gat get ggt etc aat tat ggt cat ggg 1827 
Phe Ala Arg Arg Ala Leu Trp Asp Ala Gly Leu Asn Tyr Gly His Gly 
510 515 520 

aca ggc cac ggc att ggc aac ttc ctg tgt gtg cat gag tgg cca gtg 1875 
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Thr Gly His Gly He Gly Asn Phe Leu Cys Val His Glu Trp Pro Val 
525 530 535 

qga ttc cag ccc aac aac ate get atg gec aag ggc atg ttc act tec 
Gly Phe Gin Ser Asn Asn He Ala Met Ala Lys Gly Met Phe Thr Ser 
540 545 550 

att gaa cct ggt tac tat aag gat gga gaa ttt ggg ate cgt etc gaa 
lie Glu Pro Gly Tyr Tyr Lys Asp Gly Glu Phe Gly He Arg Leu Glu 
555 560 565 

gat gtg get etc gtg gta gaa gca aag acc aag tac cca ggg gag eta 
Asp Val Ala Leu Val Val Glu Ala Lys Thr Lys Tyr Pro Gly Glu Leu 
570 575 580 585 

cct gac ctt gtg gta tea ttt gtg ccc tat gac egg aac etc ate gat 
Pro Asp Leu Val Val Ser Phe Val Pro Tyr Asp Arg Asn Leu He Asp 
590 595 600 

gtc age ctg ctg tct ccc gag cat etc cag tac ctg aat cgc tac tac 
val Ser Leu Leu Ser Pro Glu His Leu Gin Tyr Leu Asn Arg Tyr Tyr 
605 610 615 

cag acc ate egg gag aag gtg ggt cca gag ctg cag agg cgc cag eta 
Gin Thr He Arg Glu Lys Val Gly Pro Glu Leu Gin Arg Arg Gin Leu 
620 625 630 

eta gag gag ttc gag tgg ctt caa cag cac aca gag ccc ctg gee gee 
Leu Glu Glu Phe Glu Trp Leu Gin Gin His Thr Glu Pro Leu Ala Ala 
635 640 645 



2019 



2067 



2115 



2163 



2211 



2259 



agg gee cca gac acc gee tec tgg gee tct gtg tta gtg gtc tec acc 
Arg Ala Pro Asp Thr Ala Ser Trp Ala Ser Val Leu Val Val Ser Thr 
650 655 660 665 

ctt gee ate ctt ggc tgg agt gtc tagaggctcc agactctcct gttaaccctc 2313 
Leu Ala He Leu Gly Trp Ser Val 
670 

catctagatg gggggctccc ttgettaget cccctcaccc tgcactgaac ataccccaag 2373 
agcccctgct ggcccattgc ctagaaacct ttgeattcat cctccttctc caagacctat 2433 
ggagaaggtc ccaggcccca ggaaacacag ggcttcttgg ccccagatgg cacctccctg 2493 
caccccgggg ttgtatacca caccctgggc ccctaatccc aggccccgaa ataggaaagc 2553 
cagctagtct cttctcttct gtgatctcag taggectaac ctataaccta acacagactg 2613 
ctacagctgc tcccctcccg ccaaacaaag ccccaagaaa acaatgcccc taccacccaa 2673 
gggtgccatg gtcccgggaa aacccaacct gtcaccgcgt gttgggcgta accagaactg 2733 
ttccccccca ccagggctta aaaatcgccc ccacttttta accatcgtcc attaaccacc 2793 
tggtgggcat agecagaget gttcgaaccc agecagggat gaaaaatcaa cccccgacat 2853 
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ggaacccatg attcctaaac ccggggtagg ttccatgcca agtaacagca gagggagtta 2913 
agccatagga atttggctgt ggagtaagag ggaatgcggt gaggcagtgt ggaatatgac 2973 
cctaccagag gttggagaac aaacttgggc agccggaacc cgtcactatt ttagattcct 3033 
ggcattcgag gagccctttg aactttccaa agtgcagcca cagctacaat gctgttaaat 3093 
cctcccacat ttcttggatg ccccttcacc ttgtgtggac agtgtctggt ttccccattt 3153 
tacagacagg aaaactgagc ttcagacagg gggtgggctt tgcctaagga cacacaaatt 3213 
tggttgggag ttgatggggc cagatgagcc agcattccag ctgtttcacc cttcagcaac 3273 
atgcagagtc cctgagccca cctcccagcc ctctcctcat tctctgaacc cactgtggtg 3333 
agaagaattt gctccggcca aattggccgt tagccacctg ggtccacatc ctgctaagac 33 93 
gtttaaaaca gcctaacaaa gacacttgcc tgtgg 3428 



<210> 2 

<211> 673 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ala Arg Ala His Trp Gly Cys Cys Pro Trp Leu Val Leu Leu Cys 
1 5 10 15 

Ala Cys Ala Trp Gly His Thr Lys Pro Leu Asp Leu Gly Gly Gin Asp 
20 25 30 

Val Arg Asn Cys Ser Thr Asn Pro Pro Tyr Leu Pro Val Thr Val Val 
35 40 45 

Asn Thr Thr Met Ser Leu Thr Ala Leu Arg Gin Gin Met Gin Thr Gin 
50 55 60 

Asn Leu Ser Ala Tyr He He Pro Gly Thr Asp Ala His Met Asn Glu 
65 70 75 80 

Tyr He Gly Gin His Asp Glu Arg Arg Ala Trp He Thr Gly Phe Thr 
85 90 95 

Gly Ser Ala Gly Thr Ala Val Val Thr Met Lys Lys Ala Ala Val Trp 
100 105 no 

Thr Asp Ser Arg Tyr Trp Thr Gin Ala Glu Arg Gin Met Asp Cys Asn 
115 120 125 

Trp Glu Leu His Lys Glu Val Gly Thr Thr Pro He Val Thr Trp Leu 
130 135 140 

Leu Thr Glu He Pro Ala Gly Gly Arg Val Gly Phe Asp Pro Phe Leu 
14 5 150 155 160 
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Leu Ser lie Asp Thr Trp Glu Ser Tyr Asp Leu Ala Leu Gin Gly Ser 
165 170 175 

Asn Arg Gin Leu Val Ser lie Thr Thr Asn Leu Val Asp Leu Val Trp 
180 185 I 90 

Gly Ser Glu Arg Pro Pro Val Pro Asn Gin Pro He Tyr Ala Leu Gin 
195 200 205 

Glu Ala Phe Thr Gly Ser Thr Trp Gin Glu Lys Val Ser Gly Val Arg 
210 215 220 

Ser Gin Met Gin Lys His Gin Lys Val Pro Thr Ala Val Leu Leu Ser 



240 

225 



230 235 



Ala Leu Glu Glu Thr Ala Trp Leu Phe Asn Leu Arg Ala Ser Asp He 
245 250 2bb 

Pro Tyr Asn Pro Phe Phe Tyr Ser Tyr Thr Leu Leu Thr Asp Ser Ser 
260 265 270 

lie Arg Leu Phe Ala Asn Lys Ser Arg Phe Ser Ser Glu Thr Leu Ser 
275 280 285 

Tyr Leu Asn Ser Ser Cys Thr Gly Pro Met Cys Val Gin He Glu Asp 
290 295 300 

Tyr Ser Gin Val Arg Asp Ser lie Gin Ala Tyr Ser Leu Gly Asp Val 



310 315 



305 

Arg lie Trp He Gly Thr Ser Tyr Thr Met Tyr Gly He Tyr Glu Met 



He Pro 



Arg Glu Lys Leu Val Thr Asp Thr Tyr Ser Pro Val Met Met 



340 



345 



Thr Lys Ala Val Lys Asn Ser Lys Glu Gin Ala Leu Leu Lys Ala Ser 
355 360 365 

is Val Arg Asp Ala Val Ala Val He Arg Tyr Leu Val Trp Leu Glu 



His 

370 



375 380 



Lys Asn Val Pro Lys Gly Thr Val Asp Glu Phe Ser Gly Ala Glu lie 
385 390 395 

Val Asp Lys Phe Arg Gly Glu Glu Gin Phe Ser Ser Gly Pro Ser Phe 
405 410 

Glu Thr lie Ser Ala Ser Gly Leu Asn Ala Ala Leu Ala His Tyr Ser 
420 425 4 30 

Pro Thr Lys Glu Leu Asn Arg Lys Leu Ser Ser Asp Glu Met Tyr Leu 
435 440 * 45 

Leu Asp Ser Gly Gly Gin Tyr Trp Asp Gly Thr Thr Asp He Thr Arg 
450 455 460 
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Thr Val His Trp Gly Thr Pro Ser Ala Phe Gin Lys Glu Ala Tyr Thr 
465 470 475 480 

Arg Val Leu lie Gly Asn He Asp Leu Ser Arg Leu He Phe Pro Ala 
485 490 495 

Ala Thr Ser Gly Arg Met Val Glu Ala Phe Ala Arg Arg Ala Leu Trp 
500 505 510 

Asp Ala Gly Leu Asn Tyr Gly His Gly Thr Gly His Gly He Gly Asn 
515 520 * 525 

Phe Leu Cys Val His Glu Trp Pro Val Gly Phe Gin Ser Asn Asn He 
530 535 540 

Ala Met Ala Lys Gly Met Phe Thr Ser He Glu Pro Gly Tyr Tyr Lys 
545 550 555 560 

Asp Gly Glu Phe Gly He Arg Leu Glu Asp Val Ala Leu Val Val Glu 
565 570 575 

Ala Lys Thr Lys Tyr Pro Gly Glu Leu Pro Asp Leu Val Val Ser Phe 
580 585 590 

Val Pro Tyr Asp Arg Asn Leu He Asp Val Ser Leu Leu Ser Pro Glu 
595 600 605 

His Leu Gin Tyr Leu Asn Arg Tyr Tyr Gin Thr He Arg Glu Lys Val 
610 615 620 

Gly Pro Glu Leu Gin Arg Arg Gin Leu Leu Glu Glu Phe Glu Trp Leu 
625 630 635 640 

Gin Gin His Thr Glu Pro Leu Ala Ala Arg Ala Pro Asp Thr Ala Ser 
645 650 655 

Trp Ala Ser Val Leu Val Val Ser Thr Leu Ala He Leu Gly Trp Ser 
660 665 670 



Val 



<210> 3 
<211> 50000 
<212> DNA 

<213> Homo sapiens 
<400> 3 

gattcctagg ctccagaaat tctgagtcag ttggtctgag aggaggcaca ggagtctgca 60 
ttctaaatca ggacccaaga gcattttgat gcaggaccac actttgagaa taccacccta 120 
aaggatctga cctctgccta ccataaccct tcctcacccc tgccaatata ctctgatact 18 0 
gtccttttac tcctcccagt gagctgaact ctttccagcc tcagggcctt tgcatgtgct 240 
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gtctacccag aaaactcctt cctctctctt tcttgcctca gcccccatct ttgctgtgcc 300 
taaccctaag tcacccgccc tgtcccattg actctctgcc ctcattcctg ggcttcctgg 360 
tatagcatgc tggctgactg cccctgggct tgccatgtac tcttccacaa tcttcctctt 420 
cctctcttga tgacctcacg tacctgctcc acggggcctg gggaggcagg aagctatagt 480 
ggttcttcac ataggctcta atgaccagac tgcctgagtt caaatcccag ctccgccact 540 
tctagatgtg tgaccctgat caaaagttaa ttgacgtctc tgtgccttgg tttccatgtc 600 
ttttctctcc agctttattg ggtataattg agaagacaaa ttgtacgtat tgaaggcgtt 660 
tccacatctt aaaagtaatg atagcacctt tcctcatagt atcatcacga ggatgagatc 720 
agacactaca tttccaggcc ttggaacagt gcctggtgcc taagtgctaa ataaatggca 780 
gctagcgcaa ttattcccaa gccctccacg ggcctccagg cccagaaccc gcttctgctg 840 
cccttcagct cccagcaagt gactgtaccc tttactttgt gaaaatgtca aaacagcctt 900 
gccccctgca catcaaaatg tcttcagctt aacctcatcc ctcccccttt ccatctgtct 960 
gagaacaagc ggtcctcctc gccaagacca gctcttccac ctgcactttt gagcccttct 1020 
cctccaccct: ctctggactt ttccccatca accatcactc ctcctcctac atctcagtat 1080 
gggctattca cttgacaagt atttgttgag tggctcctgt gcgccctgtg ctgttcaagg 1140 
tgccggggac ctagaattaa accagacaag gttgccgctc tcatggtgcc ttcatcaaga 1200 
gggcagggca gggatgaggg tggacagaca tgcaaattaa cagtacccaa ggtcactaca 1260 
gatcctgaca agtactgtgg agagtatcaa acggggatgc actcgagggt acagcggggg 1320 
gccttggcca ggaaaagttt atttgaggag gtgatgtttt agctgagact taaacctcag 1380 
gaagccaaga caccgtatct tgatttgaat acccccaaaa agagccacac gcgcaagatc 1440 
tggcaaaaga cattctgagc aaaggaaaag agaaatgcaa aggccctaag gcagcaaaac 1500 
agctggctgt gctcagggga cagccaggca gaactctgtg agggacagga cttgctggac 1560 
aactgatgag caggaacgtg atcacagagg gcctgaggaa ccccaggcag gagcttgcat 1620 
ttcattcgac accacaggga gccatggctg gctttactca gggaagtgac gtgatgtgac 1680 
ttctgtatgc tccttctcct aaccttagga agagtaataa ctaataactc tagctgtcat 1740 
ttatcccaga tactaataca ttcaacatca cttttgcctt tacgatagcc tgataaagta 1800 
ccacattatc ccactttttt tttttttttt taagacggag tctagctctg tcacccaggc 1860 
tggagtgcgg tagcataatc tcggctcact gcaacctcca cctcccgtgt tcaagctatt 1920 
ctcctgcctc agcctcccga gcagctggga ttacaggcac gtgccaccac gcccggctaa 1980 
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gttttgtatt tttagtagtg acagggtttc 
tgacctcgtg atccacccgc cttggcctcc 
catgcccggc cattagccca cttttatagg 
aacttggtac tcggctagtg gctgaccaga 
ttaaacacag ccattgtctc cccagctgga 
acccctatcc cttctagcta catcccaaga 
gtcaaattca acttctcttt gaccttgacc 
cacccaggcc taagcttttg gttatttttg 
attgcaattc aacaaatatt tatccagcac 
tgaaggaaac caaaatgggt gagactggtc 
gtttgttttg ttttatttga cacggagtct 
cgcgatcttg gctcactgca acctgcacct 
ctccctagta gctgggacta caggcgcgtg 
ttatttttag tagagacggg ctttcaccat 
tcaggtgatc agcctgcctt ggcctcccaa 
acctggccaa tattttttta aaagaaggaa 
gaaggaagaa aagagggagg aagatgagaa 
ggaggaagga aggacctcca cggaaaaggg 
tggcccctcc tttccactgc gcatttccca 
caataaaatt caaacactca ttgcctcatg 
tcctttctaa ctaatttctc agttgccagt 
ttgatgtctc tcccctattc cagacccctg 
tccaatcctt ctgcctggca tgcaaagctc 
tgagtctccc actcgacctt ctcatgccgt 
ctttggctcg gtggtctgta ccagccacct 
cccaaacccc atcaaaccca aatgtcacct 
cccaccaccc ccctcccccg aactcctaga 
ctacatacca aaggatgttg ttgccccttt 



accatcttgg ccaggctggt ctgaaactcc 2040 
caaagtgctg ggattacagg tgtgagccac 2100 
tgagaaaact gaggctcaga gagacagcat 2160 
aaatgacttc tgagcccaaa atttgtgctc 2220 
gtgggcatga aggtgagggt gcttcttaaa 228 0 
tttcatgtca gctagattgc ctgctggcag 2340 
agcttctcct ggcatcctcc tctctcccat 24 00 
gcccctgccc cactttctaa tcagttacca 24 60 
ctgccaagtg caggctacca tgcagggttc 2520 
gttacaagac aagacagaca taatattttt 2580 
cactctgttg cccaggctgg agtgcagtgg 264 0 
cctgtgttca agcgattctc ctgtctcagc 2700 
ccaccacacc cagctaattt atttgtattt 2760 
gttggccagg ctggtcttga actcctgatc 2820 
agtgttggga ttacaggtgt gagccactgc 288 0 
aaaaaggaaa gatggatgaa agaagggcag 2940 
aaagggaaga aagcaagaaa gaagaaggag 3000 
gccgagatca gctcaaccgc agtctctagc 3060 
gcccctccaa ctcctgcccc acccccccac 3120 
taaggagtta aatcgcctga tgtatttaac 3180 
ttctctccct gtacacatca ctgcactgtt 3240 
ctggcccccc agcccacttg ccacagaagt 33 00 
tccttagtct cttccaccca cctctcaggc 3360 
tcccttccct aaactcatgc gcactctctg 3420 
ggaaccctcc agcccccacc tctccacctg 3480 
cttccaggaa gccttttccc acaacccctg 3540 
acctgccctc tgtaccacta ttttaacaca 3600 
gggactggaa gcttcaggag agtgggaacc 3660 
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aagctgggcg tattttagct tccccacaat gcctcgcaga gagtaagtgt tcaatgatgt 3720 
ttggctaata aataaagatt gtctgttaaa gcataagtgt atccaataag tgttcctgaa 3780 
atgtatatac atttaattat tcattctgag ctctattaac tttagaggag ttcatgtggt 3840 
agggggggac ttactgaatt atattaatgt aatccaagaa gataatagtt tttagcaacc 3900 
acacagtact gttattttct attgttttgt aggcaacaaa atcagccttg ttaaatttgc 3960 
tttttaaaat tgattctcaa agtttattaa aaaattagaa ctagaactac catatggttc 4020 
agcaatccca cttctagaga tatatccaaa agaattgaaa gcagggtctc aaagctatat 4080 
ttgcacaccc atgttcctag cagcattatt cacaatagcc aaaagatgga agcaccccaa 4140 
atgtccatgg atagaaaaac aaaatgtgtt agttctctac atacaatgga atatgattca 4200 
gccttaaaaa ggaaggaatt ctgacacatg ctgctacatg gatcaacctt gaggacatta 4260 
tgctaaacga aataagccag tcccaaaaag cactgtatga ttctacttct atgaggtccc 4320 
tagaatagtc aaatccatag agaaaagaag catggtgtta ccaggggctg gggattgggg 4380 
aaatgggcag ttgtttgatg ggtatacagt ttcagctttg caagatgaaa atgttctgga 4440 
gattcattgc atagcaatgt gaatactcct aacactattg aactctacac ttaaatatgg 4500 
ctaagatgat caattttatg ttttgtgctt tttttttttt tttgagtcta gctctgccac 4560 
ccaggctgga gtgcagtggc gcgatcttgg ctcactacaa cctccgcctc ccaggttcaa 4620 
gcgattctcc tgcctcagcc tcccgagtag ctgggattac aggcacccgc caccatgccc 4680 
agctaatttt tgtattttta gtagagacag gatttcactg tgttggccag gctggtctca 4740 
aactcctgac ctcgtgatct gccctcagcc tcccaaagtg ctgggattac aagcatgagc 4800 
caccgtgcct ggcccctgtt ttgtgctttt taccatgatt aacatttttt ttttatctta 4860 
agtgattttc caaaggcaag tctgatcaca tcacccctct ctgggtccca gaattcttac 4920 
ctggacccac aagccttggc ttgtctgctt gtcatggact caagcctcct cccacatcct 4980 
gccatccccc attctttcag atgagatgag gcacattcac ggtggtatgg ccgtagaccc 5040 
atcccccatt ttttcagact ggacttctgt ccttgtctgc tccttgctcc ctctcaccac 5100 
agagccaact cctcccaagg cctcagtctc tgcgaaaaca acactttctc tgaggagctt 5160 
tccctggcca cctcctagcc taggctgtaa catgtccttg tgacacgctc tccttgcacc 5220 
ctttcctctc cttcactttg taattacaca tttattattt gatcaaggtc actctctccc 5280 
accagctcag gaacgctgct gaggcagaga ccacatcttt tcctcccctt actatcatat 5340 
ccccagcagc tagcctgatg cctggcatcc agtaatcttc atcattgatt attccagtgt 5400 
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tggcaaaggc 


aaggaaaaac 


aagcccgcct 


gctgcttgga 


caagagtaaa 


taagtgtcac 


5460 


ctttctaaaa 


gcaatgtggc 


aacatgcatt 


aagagcctgg 


aagcattcac 


actcgtaatt 


5520 


tcacttctag 


gaaatcaccc 


tagaggagcg 


caatggcgta 


agtccacccc 


tgttcatctc 


5580 


aacatactca 


ttagcacaaa 


aaacaaagca 


atttacataa 


ccttcatcac 


cagaagaagg 


5640 


gtaaacacat 


tgtagatgtt 


tatggatata 


ttatggcgcc 


ttttcaaata 


atgtttttga 


5700 


ggaattgtga 


tgtggaaaaa 


tgttaggata 


taacatgagg 


tggaaaaggc 


agcatataag 


5760 


taggtatgta 


cctaatgttc 


ctaaatttaa 


aatatcagat 


ataaagataa 


tacataggca 


5820 


caaaaatagt 


ggaagtccat 


tctccaaact 


gttaatagca 


gtcaccccta 


gagggtggca 


5880 


tcatgggcag 


tcgtggcttc 


agaatgtcta 


cataggaggg 


tttagaaacc 


atcgaaaggt 


5940 


aggctagggg 


attcgtcttg 


aagctgcaag 


atggcctttg 


cttagcaagc 


agggttttgc 


6000 


cttacactgc 


atctatattc 


agagtcacta 


tgggggtgct 


gatggagatt 


aggataaagc 


6060 


ccccaagcca 


ctctctggag 


ccaatctcaa 


tttttttttt 


tttttactct 


tatttatatg 


6120 


tctggttttc 


taaatgctct 


acaatgaaga 


tgtatgactt 


ttataatcag 


aaaaagatta 


6180 


aggaaaacat 


tatttcatga 


aagaaaggca 


cttctgtgtg 


ttgggatgaa 


tgaacacagt 


6240 


gtcctgctca 


agcacatagg 


cggctggaca 


gggccttgcc 


agcacagtcg 


ctcaccgcag 


6300 


ctccctcaca 


tcacaagcct 


gccctccccc 


tcgcacccgc 


aggcatctgt 


cctccagctg 


6360 


ctgacacagc 


tgggcagcaa 


aggtcccatc 


ttgactgtgc 


acctcggagg 


cagcaccgtg 


6420 


gagcatggag 


ttctaaggca 


tgtccacggg 


ccaagcacac 


ttgtaaaggc 


catttgacaa 


6480 


caatgggcta 


tacccctcta 


cctctaacgg 


tggccctggc 


tttgaagagc 


aaggatcgtt 


6540 


ttctgattgt 


agctgtggcc 


actagatggc 


agaggagaat 


gacggtttcc 


agttgtataa 


6600 


taaaattggt 


atttaaagct 


gcagttactt 


ttcttatttc 


atgatcattt 


ctattttggg 


6660 


aaattcggtt 


gacttttctc 


cttacagtct 


cttacttatg 


gccttaaaag 


tccacaaaaa 


6720 


caaaaacttt 


tttttttcag 


aactaggtta 


aaagtatcta 


aagttcatgg 


aagtttcagc 


6780 


actcttcatt 


gaacacaggt 


agaattaaat 


cacattttcc 


tctcttgcct 


ctttcactgt 


6840 


ccatcaatgt 


ttcaagttat 


tttgaaagtg 


tcaacaatgg 


ttgggaggtg 


atctcactgc 


6900 


tgaaaggtaa 


tgacttcgat 


cttctctgga 


gaggggtcag 


ctgggaagga 


acgatggaat 


6960 


caatcgaggc 


agttggagag 


gcggggagaa 


acacagttgt 


gacctgggaa 


gaaacagact 


7020 


aatccctggc 


cagaagatgg 


cacagcaaag 


tggagaggaa 


ggagaccaaa 


agggaatggg 


7080 
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gcttgttaca ggttcagttg tatcccccca aaatcgatat gttgaagtcc taatccccag 7140 
tacctgtgaa tgtgacctta tttggaaata gactcgtcac agatataatt agttaagtaa 7200 
gatgaggtta tactgggagt agggtgggcc ctaatctaat ataactggtt tccttataaa 7260 
aaagggaaat ttggacagag acacacacac agggagaaca ctgtgtaaag gcaaaggcag 7320 
ggatcagggt gatgcatctc taagccaaag aatgccaaaa attgccagca aaccatcaga 7380 
agctcaggat gggcatgaga cagattctct ttcacagccc tcagaaggaa ccaaccctgc 7440 
tgacctcgat ttcacactct ggcctccaga cctgtgagac aacatgtttc taagcctccc 7500 
actgtggtac tttgttactg cagccctggg aaactaatac aaggattttg gagccatgga 7560 
atagtggtaa aatcgtaata gaatttcttc actcacttct aggactgttg gacctgatac 7620 
gtgggggcga tggctgaagg atagaagttt cctcctctac cattctatca tcccctccac 7680 
aatcacatcc ctcatttctt ttcttttttt tttttttctc tctttttttt tttttgagac 7740 
agaatctcgc tcccgttgcc caggctggag tgcagtggcg cgatctcggc tcactgcaac 7800 
ctctgcctcc caggttcaag tgattctcct gcctcagcct ccttagtagc tgggattaca 7860 
ggcacatgcc accacgcccg actaattttt cacattttta gtagagacag ggtttcacca 7920 
tgttggtcag ctggtctcga actcctgacg tcaggtgatc cacccacctc ggcttcccaa 7980 
agtgctgaga ttacaggtat gagccacatt tcttgaacga gttgtctcca cttgttgtcc 8040 
ccactctctt acctccactc acccctcaac cctttctagc ccattcctct gacatgattc 8100 
tcaccaaggt caccagcaac ctccatgctg ccaaaaccaa aggtcgtttc tcagtcctca 8160 
cctaacccaa cctctcagca gcattccacc ccttcttcag gaacactccc ttcaccgggg 8220 
cgcaggaact ccacactctt cccagagttc ctgctgcctc accaaccact ccatctccat 8280 
ctccattgca ggctcctcta ctttcttggc cccgccacta gatgatggca tgtgctgagg 8340 
tttaggcttg ggtcacttct cctcttactc tatactctct ctctcgctcc ataaacccca 8400 
ctcactcctc tgtctgcagc tgtctccaaa ctctatctga agcagtctgt tgctgccacc 8460 
acacacctca gacttgttta cagtggcatt acatcctatt caggtttaca tccttttttt 8520 
tctttttttt tttttgagac agagtctcga tctgtcaccc aggctggagt gcagctgcat 8580 
gatctcagct cactgcaacc tccacctccc gggttcaagc gattcccctg cctcagcctc 8640 
ccgagtagct gggattacag gcgtgcgcca ccacgcttgg ctaatttttg tatttttaat 8700 
agagatgggg tttcaccatg ttggtcaggc tggtctcgaa ctcctgacct tgtgatctgc 8760 
ccgtcttggc ctcccaaagt gctgggatta caggcgtgag ccactgcgcc tggccaacca 8820 
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tgctcatttg tttatatatt gtctatactt 
gttgctacag agactgtatg gtctgcagag 
gaaagtttgc caaactttga tctagaccaa 
tgcgacccag gacagctttg caggcagccc 
tatgagatgt ttttgtgatt tgattttttc 
tcgttagtgt attttatgtg tggcccaggg 
actgtggtct cagccttcac cactgaaggc 
gatgggaagg gctccctggc agaacctccc 
ggggtggagc cacataagtt cctgtcattt 
cctgggtgga acctgagatt cagcaagcgg 
gccgtgatcc cagcactttg ggaggctgag 
agaccagact ggccaacatg gtgaaacccc 
ggcgtggtgg cgtgcacctg taatccagct 
accaaaactc gagagaaagg agggggcttg 
gccagaggac attggaagtc aaagggaggc 
tctgcactac atcctgaggt gtccccagtc 
agagagggag ctgagtttcc tgagccaggc 
gctctggccc acaaagtcat ctggggtctg 
ggatcctcac ctcttgccat acacaggctc 
tgggggttct ttctgactgg ctgtgacccc 
acagccaccc tctatggcga ccagccctac 
gaacccaatt gtgagattcc cctctgcctc 
ggcaagtgtg ggcgggtcct ctggcacatt 
atctggaagg ctggggattt cggaatctct 
tcccaccaag aatggagacc tcagggccta 
ccacctcctt accagcgggt ccactgcaca 
gagtgccaac agcactcctg acggatgggt 
ctgccagggg agaatcagga atgaaaagct 
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gctttggggc tgcaacagca gagttgagtg 8880 
tctaaaatac ttactatctg accctttaca 8940 
gcttgtccaa ccagtggccc gtgggctgca 9000 
aacacaaatt cataaaattt cttaaaacat 9060 
tttctttttt tttctttagc ttttcagcta 9120 
aagccaaaag attggacagc cctgatctac 9180 
ttggggtccc ttaacatagt cagacagcca 9240 
acggcctgcg cactgggaag aatgcgaagt 93 00 
gcagccggga ggcgccaggc ccctcctctt 9360 
agacaactct ttcaagaaat gtggctcacg 9420 
gcgggtggat cacctgaggt caggagttcg 9480 
gtctctacta aaaatacaaa aaattagccg 954 0 
actcctcagg ggatatagta aagactaatg 96 00 
ccattcctag ggcatggctc accatctgct 9660 
accagcagtg ggtcagtgac agcttcagcc 9720 
ctcatagcac atgcctgcag tctgaagaca 9780 
tcctgttcag tcaccccaga ccagcttcaa 9840 
gttgtctctc agctcccctc cttgggacat 9900 
cagtgtggaa gggatacagg atggggcatt 9960 
agagagggag gtgtcatgct ggagagttgg 10020 
cacccggcct ggaaacatgc ccactgtggg 10080 
accccagttt tctgggcgga gatgtccaca 10140 
aagctttatc tgtaggctgg tacctatgaa 10200 
gacccattca acctggagca tcttgctgag 10260 
gttgtttgat ttgcgaaatg tcattttagg 10320 
aatgtcttgc tcagatcctt aagagctgag 1038 0 
ggccagcagc agaggcagga gccctgtgcc 10440 
ttcccagtgc tggctgggcg cggtggttta 10500 
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cgcctgtaat cccagcactt tgggaggccg agctgggtgg attatgaggt caggagatca 10560 
agaccatcct ggctaacatg gtgaaacgct gtctctacta aaaatacaaa aaattacctg 10620 
ggcgtggtgg cacgcgccta tagtcccagc tagttgggag gctgaggcag aagaatcgct 10680 
tgaacccagg aggcaaaggt tgcagtgagc ccagatcgtg ccactgcact ccagcctggg 10740 
tgacagagcg agactctgtc taaaacaaac aaacaaacaa aaaacaactt tcccagtgct 10800 
tacaaatcca tcttccatct cacctcggcc tgcagtgtgc tgtgtgacca ctaagaggca 10860 
ctgtgaggtc acaagaagct tggagaagcg cggccaccat ttcagtccca gcctccggga 10920 
agtgagaaaa cccctaggga aaggcgcagg attctgggac tctttgggac atccctccct 10980 
ggcagaaagg atctattcta gtcagcagtg gggacctggg ctgggcacct catgctggcc 11040 
gttgcgaagg gtggtgcaaa gaggaaaaga acagagacct aaaggggccc taattcatcc 11100 
caggcaccgg ccactggcag atgactggcc caaacaagcc cggagggcca cactgcagaa 11160 
ccaagcagga ggcggagccg agcagggaag gcgggaccct ggaggacgtc ttggctcctg 11220 
gcttcgcggg ttctttggga tgttttagga aggactcttg acctccagat gtgtggtggt 11280 
tggaggtatg ggatgcagtg gaggacccaa cacggaacaa acgggagact gggtgttgtt 11340 
ttgacacctc cctttcatgt tctcacgtcc atgtcatcgc caagttcttt catttctacc 11400 
tcctgaatat ccctggaatt tttccccttt gacccaaatc tgctccctcc acccttgtct 11460 
aagccaccat catctctcct tgatgtctcc aagatacggt tgccgcataa aatacaggac 11520 
acccagttaa atttgaattt cacataaaca cagaataatt tttagtacat cttgtacaat 11580 
acttgggatg tacttatact aaaaaaccat tgtctattat ctgaaattcc catttaactg 11640 
ggtatccggg gtttggtttt gtttcactct tttgttttgt ttctaaatct ggcaactcga 11700 
ccaagagttt tcccagccct gtgcctcttc aaggctttct ctacagagca gccagaatga 11760 
acatacagaa attcaaatct acccaggtca ctgccttgct taaaaccctg accacctccc 11820 
agctctcaac tccatcttcc actgtttccc taagcccaca gcatgccccc agcaaaccaa 11880 
atacactgtt gcctcttaaa tgtgccccac atccttctac acactcccat ccctgtccag 11940 
ccacctaaag aattcaaaca tggttatcaa actcaactac cacccctttt tgtcaggctc 12000 
ttccagacat gtcgcctgcc tcccccaagc caagcacgcc ctctttgcac tgtatagtag 12060 
ccttttgtag agcacttgtt tagctgtatc taaatgattt gtttaggtgt ctgacttccc 12120 
ccactagact gtgtcctcct tcaaggaagg ggcccagtgt tattcatctt tgcaaccaca 12180 
gtgcacagca cagtgcgtgg cccagagtgg ggcattcaat gaaggaagga aagaaggaag 12240 
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gaagtaagga 


aggaaggaag 


gaaggaagga 


aggaaggaag 


gaaggaagga 


aggaaggaag 


12300 


gaaaattcgt 


gattccaaat 


ctgaagccta 


gatgggccac 


tggaagtgac 


tgcaatgaca 


12360 


gtacaggaca 


ccagggggca 


gaataacctc 


agttttcttt 


aagggagagg 


ggcgtggtac 


12420 


agaaagccga 


cttcttggct 


ctgggcttcc 


ttctccagtt 


ccaaagagaa 


gagttttatt 


12480 


gttttgtttt 


gtttttaaat 


cacggaaaca 


taaaacaaga 


atggacctta 


agagaccatg 


12540 


ggaaccagtc 


ctctgctttt 


tcgtagttaa 


aaaattcttt 


gcatttccac 


ttaactatct 


12600 


gttttcaggt 


ggtgaaaaac 


atcctttggt 


ttgttacaga 


caaatcacag 


agaggttaga 


12660 


ggacttgctc 


caaattaccc 


agcaagtgag 


cagtacaagg 


aggcctggag 


cccagtgttc 


12720 


tgcagacatg 


atcccatttc 


atcctcactg 


cagccccatg 


agcgaggcac 


tattgttgtc 


12780 


ttacaggtgg 


ggaacataga 


gaggttgtca 


tctgactgga 


tcacaccgtg 


gtagagcagg 


12840 


gagctggatg 


aatgcaggca 


ggtctccaat 


cttcctttct 


agtgtcccta 


cctctcttac 


12900 


actctcaaat 


ctttgcttgg 


ctcttttttt 


tttttttttt 


tttgacagag 


tctcactctg 


12960 


tcgcccaggc 


tggagtgcag 


tggcgccatc 


tcggctcact 


gcaaactctg 


cctcccgggt 


13020 


tcaagtgatt 


ctcctgcctc 


agcctcccta 


atagctggga 


ctacaggcgc 


gtgccaccac 


13080 


acccagctaa 


ttgcrtggct 


tttttggggg 


ggttgggggg 


tgggtgctaa 


ccccagat cc 


13140 


ctacatcctc 


agcactcaag 


cccagaacgt 


cttcatggga 


ctgacagctt 


ctgtgaatcc 


13200 


cgtggggaac 


cactgggcat 


cgacccaggc 


cctgtgacag 


ctacaaatag 


gtaagacaaa 


13260 


tgagggtatc 


tgggatgcag 


actctaggga 


ggcattcctt 


caagggcagg 


ggtatgcctg 


13320 


agagtgagcc 


cctcctgaca 


ctgtgtgccc 


cagggtcctc 


acttgcctcc 


ctctaacctt 


13380 


ggccctgggt 


gcagtatttg 


aacagaaacc 


ctgttctcct 


ttcctccaac 


tgctaggcca 


13440 


gtctaactat 


ggagaggtct 


caaggaggca 


ggagccactt 


gagctcttta 


aggaaggtga 


13500 


gatggatggc 


aagtgaccac 


cagagactgt 


gttccgccct 


gtataactgc 


ttcattgtag 


13560 


agcctcaact 


caatagggta 


caaacacaca 


atgcttttct 


ctttccacag 


tcatgaacca 


13620 


tcttggcagg 


gttctggagg 


cttccccact 


tcagagttcc 


ctgacatggg 


agaagctatt 


13680 


tggccactac 


ttcttccaac 


caacccaatc 


cctacccaca 


cctatgcaca 


cacctgtccc 


13740 


cagcagtatc 


tacaaagacc 


tttgttgttg 


aggtcccctt 


gccctgaagc 


cagtcctctt 


13800 


gaacaggaca 


aggtaagaca 


actcaaatgt 


gtgacctttg 


gaggtgtccc 


tctgacctga 


13860 


ggaaagcgca 


catggctggc 


tacaccactg 


ccgcctctgc 


tctttgccct 


ctctccactc 


13920 
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ccctttagtt gttctcatag gctcagggca gagtcacaag gctgccaccc aagcagatgt 13980 
ccaaccagga atgagataac atgtcccact tcacagcctg accccctatg ccgtcccaat 14040 
cttgatggca ttctcttgga acccctcact tggccacaac actgggccaa agtctcctga 14100 
tcatcccagt ggtgccccac cactactttc tgcttatcta gacggggtgg gagcagtaga 14160 
acctgtttgc taagctctca ataagccctg aaggaagcat tggtccccta tcattggaag 14220 
ttccttggca cttctttctt ggaactagtc ctcagctttg ggcccttgtt gccagttctc 14280 
aggtctatag aagtcccact cacatcctgt tgaatgtcaa ttgcataatt ttgtatccct 14340 
gtgttccaac tcctgaggac tggagggaaa atgagtgtcc cctcttcacc tcagctattt 14400 
tacattggat ggaaatgggg catagagtac caaggtgcca agaatggagg aaaggataag 14460 
ctgggagcca ttctcaagta aaagggacct tggagtagtg gaaagagtaa ctcattgtca 14520 
ctcactggat gcatgccttg tgcaagtcac ggaactccct gagattcagt ttcttcatct 14580 
gtaaactgca gataacacaa atgtcccagg aatgtagtaa agattagatg agatcatgta 14640 
tgtagaagcc ttatataaac tgtaaagcac tgatcaaagg agtgaagcta attattaatt 14700 
agttttaaag aagccatccc taccatcaaa gatcttgcag ggattgggta gcaattcata 14760 
agagattatg acagagggct gagctgtggg aggcagtaga gagcagttgt taagagcagg 14820 
gaccttggtt tctagctctg gcacatccta gctgtgtgac tttgaataag tcactgcccc 14880 
tctctgaacc tcagtttcct cagatgtaaa gtaaggatga taacacctcc ctcaggacct 14940 
gggaggatcg gatgaaatgg tacacaagaa gaatgagcta gcacagggct taatacatag 15000 
tggtgcccaa taaatgttgg ctgtagctag ggttattaag agatcagcag ccattacaaa 15060 
gggaaggagg ggtcagggaa aatctcctat gggagattag gcttgaattt ggccttgaaa 15120 
acaggagaca aatttggaat gtcagaaaaa gttgcacata agtaagtctt gccaagcact i5180 
gggggaacaa aaaagtaagg ccccaaagtt tacttccatc ctctctccta gatggtctga 15240 
aaagatggtc atgcaggtgg cctggcccaa gaggggcatg ccagtgagaa gccataggga 15300 
aactgacata gtgacatggc cctcctactc ctgctgaaat tcaaacctct tcacaggcag 15360 
tcgagggaga accctgggcc aagctgcgcg gtctacctcc tactttgtga cccagggtgg 15420 
ccctgaggcc ctgggcctgt ccctctagcc ctcccctgat aaatcagtcc ctaacatgtg 15480 
cctcttggtt gccagtggat aaatgcatga ctccttccag ggaactggtg tgcatttcag 15540 
cgggggcttt ctagaaacaa taccttacct cccagagttt cccaagagca gctggagctc 15600 
ttgttttctt acctgaaacc tcagtggatt cctccagcag tttaaatctt catcgtttgc 15660 
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tgaatcccag gacccaggaa 
gccaccttca gccttggaga 
gcttttgctt ttgtttggac 
tggtcagaat atgtggagtt 
aacacaagaa aggcttattt 
agtaaggcca acctgaaaca 
ggagaagtaa atagtcttag 
gcttaaagaa gaggaattca 
actggcagat tcagtgtcag 
gctgtgtctt cccatggctg 
ctaatcctat tggatcaggg 
tacagccaca ctggcggtct 
ataacagcac accaactctt 
actctcttgg tcagaattag 
gtaggggagc aaatatttgc 
cttgaggaag aaggaaagtc 
aacacacacc tgggactgtc 
ctgaatagaa ttcttggggg 
tttcgaggtt tactatgtct 
atcaatagac actcaggaaa 
tttaaaaatc aacatattga 
tttattttga gacagagtct 
gctcactgca agctccgcct 
gctgggacta caggcgcccg 
ggggtttcac cgtgttagcc 
cagcatccca aaatgctggg 
tttttttttt aacaatcatt 
gtcaaatagg aatcttcagg 
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gcacagtttg cacctgtctt aatacccagg actcactggg 15720 
caagatgggc cagagggaga acagctggtg gcccacatct 15780 
aaagatgatg tgccatgtcc ttttctctgt cactgtgcat 15840 
atgctgcaat aacacagaaa ccctgaaatc tcaggggttt 15900 
ctggttctca caaaagtcta gtgcaggttg gatgacacgt 15960 
tgtggcctcc tagaactcca cagcaggggg aaagacagat 16020 
tcagcatggg ctgtcataac aaactagcat agactgggta 16080 
tttctcacag ttccacagct ggaaagccca cagtcaagac 16140 
gtaagggcac tctttctggc ttgcagatgg ctgacttctc 16200 
agagagagct agctctctgg tatctcctct tacgaaggca 16260 
ccccatactc atttaacctt aattacttcc taacaccaaa 16320 
agtgtttcaa tatatggata agggggacac aattcagccc 16380 
aactgtcttg gcccagaaat gatgcacatt aagttcactg 16440 
tcacgtggcc caagcttaat tgcaagaaag gctgagaaat 16500 
tgagcgcaat ctgtgccaca cactgtcttc taggctgagc 16560 
tcgaaagtca gaacagtggc tcggatgtaa agtgtaagga 16620 
tgccaggcag gttcaaggaa ccaagaccag gaacctggtt 16680 
aagagggggg aggagggaac ttgtccctgt gtttggtata 16740 
tctgcccagg atgtgaacac aaatccttca tcacagatat 16800 
ttcagatgaa ttaggcagag atcagagttt ttattctgtt 16860 
atgtttcccc caaaactacg tgttttattt tatttattta 16920 
cactctgtca cccaggctgg agcacagtgg cgcaatctcg 16980 
cccgggttca cgccattctc ctgcctcagc ctctcaagta 17040 
ccaccatgcc ctgctaattt tatgtatttt tagtagagac 17100 
aggatggtct caatctcctg accttgtgat ccgcccgcct 17160 
attacaggcg tgagccactg tgcccggcca acgtgttttt 17220 
ttccactaca tagcaatagg aagagggtta acacaattat 17280 
agcgagcttt agtgaataga taagcctgat atgcaagcag 17340 
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cacagccatg gttgatttct aacttgctgg tgggtctgtt tttttaaaca gaagaccctg 17400 
ggttcctttg gtgggtactg ggaggcagtg tatcaaaccc actgtctcat agcccaaccg 17460 
ctatatgaac tagttcaaag cttgaattgg tgaagtccaa aaaagagaag ctttgctttt 17520 
ccaacatgtt cagaactgtg atgggaaaat ggttgactaa gtgccacatc tttcctctgc 17580 
tcgaatagaa tgtagatcat tgagccatgt gtagtgacaa ggtgtttttg ggaaaggtct 17640 
caatgctatg aattgttatt gtgctcatga ccctccctta agatggcgta aacgccctgg 17700 
catgatcagc ccacacctgc ctctttaacc ttatctctct gcctaatccc atagggacac 17760 
cagcctccat tctggtccaa gaggccccaa gtcctttgct gccctggggc ttttgtgctg 17820 
gctgttctct ctgcctggaa tgttcttcct ccagattttt gtgggctgct tccttttcat 17880 
ctttcaggct cagctcaaaa gccactagct catagaggtc tgacaaccca tataaagtca 17940 
aagtatctcc cctggtaccc cagaaacaca tatactgttt gtcatcaccc agtttatttc 18 000 
tttgttaata aatatcacag tcatccccta tcttgttcat caattactta cttctgaatg 18060 
tcttcggaga aatttcttag ggctgagcct actagactgt gagtttccca agggaaaata 18120 
gggactgagg tctctatagc ctttgtctcc tgcaaggtag ggaagaggca gtgatgactc 18180 
atgtttaaga atacttgagt tctggctggg catggtggct catgcctgta atcccagcac 1824 0 
tttgggaggc cgaggcaggt ggattgcttg aggtcaggag ttcaagacca gcctggccaa 18300 
cacggtgaaa tcttgtctct actaaaaata caaaaattag ccaggcatgg tggcgggcgc 18360 
ctgtaatccc agctacatgg gaggctgagg caggagaatt gcttgaacct ggaaggcaga 18420 
ggttgcagtg agccaagatc ataccactgc aatccagcct gggtggcaga gcaagactcc 18480 
atctcaaaaa aaagaaatac ttgagtccta aagtagtagt agagttatca aattcccagg 18540 
tgaaagacac tcctgaggtt tcattgagca cctactgtgt ataagtttag catccactat 18600 
gttctagggc tctaggctgg catggtcctt gcctgcctcc atggaacttg cagttccaat 18660 
ggaaaaagca taaatcaagc aattactact tttaattatc atcatttaat aataatgtac 18720 
tgatagtagt agcaatggct aactcatatt atgcttatgt gttttgtgct gggcacaggg 18780 
aggggggtta tagatcctct ctgaagacac acagtgtcca gcacaaaaca agacaaataa 18840 
taagagacat ttatatagtg cttattatgt accatgcatt gttccaaaca ctttgcatct 18 900 
attaactcat ttaattctca cagtggccct atgaggtagg taccattatt attcctagtt 18960 
ttacagatga ggacattgag gctcagagag gtcaaacagg ttgctcaagg ccacacagcc 19020 
agtgagggtg aaacgagcta ttattagtca ctacctggga attcccctgt ttttgttgtg 19080 
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gttgttgttg atctgggttg cttaggaagt gctacatcaa ttcctacagc attttagatc 1914 0 
ctgtctgctg atagttgtgg taaacaaaca ataccctatt agatgctgct gatttttctc 192 00 
attgattttg tatatgattt aatagttttg aatctcagtc agaagtgaag attacctgtc 19260 
agacactatg ctcactacct gggtgacagg atcatttgta caccaaacct gagtggcatg 19320 
tgatttaccc atgtaacaaa cctgcacatg tactcccgaa cctaaaataa aagttgagag 19380 
gaaaaacaag aaaaaaaaag agagtgaaga tttcaaaaag tcatcttggc agggaagtca 19440 
tgttgacaag aattttcttt gttctatgtt aggatctcgc gctacccagt ttaatgaatt 19500 
ctatgtagaa atgttcccaa acaccgtgtg cttccctcaa ggaatggctg aaaatgcagt 19560 
agaatgtaaa tatcatcatg ttttgtagac tgctcacaaa tcatcgaaat cagtgaattc 19620 
attccagcaa attcttcttg agctccaact atgtggaagc tacaattctt ggtactggga 19680 
ttacagcagt gatcaaaaca gcctaaagcc ccctgctttc atggtacata tgatataata 1974 0 
gagcgagaca gaaaataaag aaaatgaata agtaaaaatt gtcatttgat gataaatgct 198 00 
attagagaaa cataaagcat agaaggggaa tagggagagt gggggttggg tgcaacttta 19860 
aatagggtga ccagggcagg actcactgag gaagtaactc atgagcaaag ctctgaaggg 19920 
agtgagggca tttccagtag tccagcttga ggccctagta tatgtgagac ctgaggcatt 19980 
ggcctggctt gccctagatg ttcctcttct ttttgagcct gcttgtgttt gtcggctatt 20040 
gggtgaggct tggcacaagc agttgggccc aagggcaccc actcatccct cctgggcttg 20100 
gaaagctgct gtgctgaaat gaccccaccc ccatgcccct cctggggaca ggggacagtc 20160 
aaagagccac tagaggctcc caaaactctc aaaggggcaa accctctccc cagaacctta 20220 
tcagtttcct ggtgcaggca ctgctgagga cctcagctgt tgtctgtcct tctgggtttc 2028 0 
tatgtttagt gtgaaggcag ccctggcaca ggccttcctt ccccttccaa tccgcaacag 20340 
cctgtgctct gcattgcagc gtcacttccc tgctggaggt tgcaatgttt gcaccaaaaa 20400 
gatggctttc aaagcccaag gtgtgtgtgc ggggggaggg gtatgtgcac acacacaatg 20460 
gaaggggggc tgtatctcag accctcacat ttcatggatt catatgtctt ttgctctgag 20520 
cataaggcta ttgaagtcac agtcatggat tcctgttctt tcgtttattc aacattacag 20580 
catggggtta ccagcctgac tactagcatc acattgactt gaccttgaat attagcccaa 20640 
ccatctgctt gccatacgac cttagacaag ctacctgtaa cctctctgag tgttattctc 20700 
atctgtaaaa tgggactcaa atgcagtacc ttccttgtaa ggttgtcaag attaaatgag 20760 
atcatctttg aaaggcattc agcaccatgc ctggcacaca gtaggtgtct cataaatgac 20820 
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agctgctatt actactaatt agccagggct tactgcacac ccattgtgtg ccaggcacag 20880 
tgctaggtgc tggcagagtt agcaagagga actcaaagat gagtgagaca aggcccagct 20940 
tggctccatt cctcacccac atgtggctgc cccaaggcag tcacctcatc caagggacct 21000 
tgcagctgag tgtgcaagca cctgggccct ggcgtgggac ccagtggggt tcaggccagg 21060 
gagtgccacc agatgggtgc aacacccata cccttaccct cacaaatgcc ccccgcccca 21120 
cgctgccaga atgtcacctg agtgctgagt gccgggcctg ccccacaggg cattgggacg 21180 
gagggcccag caccaaggca aggagctgct ctgtgggctg gtctggacac agtggagatc 21240 
tgggtcagcg tttttctgag tgaattcttt tttttttttt tttgagtcgg agtctccccc 21300 
tgttacccag gctggagtgc agtggtgcca tcttggctca ctgaagcctc tgcctcctgg 21360 
gttcaagcta ttctcctgcc tcagccttcc gtgttcaagc gattctcctg cctcagcctc 21420 
ccgagtagct gggattacag gcatgtgcca ccacgcccgg ttaatttttg tatttttagt 21480 
agagacaggg tttcgccatg ttggccaggc tggtcttgaa ctcctgacct caggtgatct 21540 
gctggccccg gcctcccaaa gtgctgggat tacaggcatg agccacggtg cccagccttg 21600 
ggtcagtgct cttctgttta ccctcttcag tgtctcagcg gttggcaagg catcatcact 21660 
ctctaccttt tcttgggaag tggattcccc tccctgtcag agactcttcc tattattttg 21720 
gggccccttt ctctttcaca ctcattcaca cactccccaa tctatgtcct cagtcactac 21780 
tagccctagt ggtcagtgga tccccatggc gaaggggaag ctcattggat catagcaata 21840 
tagctcaccc gcccacaggg cgtggcccac tggaagaaga ggagaggaca ggacaggact 21900 
tggcctctgc cttgcctgat ggattgtgtt atctcacgca ggagggagtg gggaaaccca 21960 
tgtggcattc gatacacagc ccagtgctaa actatgcagt ctgagctcta tgctgtctaa 22020 
gaagagggca agggcagtga gggccatagc aaggaaggga gacttgagct gaatctcaaa 22080 
ggagaagaag gattgggatg gggtggcaga ctccaacctg ggtccagggg gatgtgctca 22140 
gctgtctgat ggctggaagc agagacccat gtgccatgag gagtgaagat gtccattagg 22200 
cccaggcgag aggtggagca tgcgaagtgg gggtcagggc tgggggcaga agaaaggcat 22260 
agaggaacca ctggccagaa ggttgcagag gccatagggg cagccttttt cctccaactc 22320 
ccattctatt ctctttttag cctttatttt ccgcaccatt cccctcctct actctctttc 22380 
cctttcttgg ggtctctttg tcaccctcct cccttttttc tgagtctcag ttattttatc 22440 
tgcaagataa atgtgctcat cctggaagct ggtgatgttg cctctcctgc ttttcattcc 22500 
aattctgaga tctcattcag caagaactgc tgtcttaggg ctgctctccc aggactgggc 22560 
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cctgttggct cctgtggata tacatccacc agcagaaaag cctgagggtc caggccttgg 2262 0 
gatctgtgcc actgcttgct gggggttgcg ggacaagcct gttttgcttc tcactcaggc 22680 
tgaagacagg tgaggatgcc aagtccagaa attgctttgc ttccagcatc aaatggcttc 2274 0 
ctctcagcag cacagtccct ttaagatggc ggggggcggg gaagctagaa gaagaccttt 22800 
gatgttgttc aactgagaaa tccatcaggt gggaacaagc ccggaaatgc cagagtgaag 22860 
ggctccgtgg ttggctgcat tggtgggtct actgcctgga ctttgttttc tgatggtaaa 22920 
cgttccctga cataaacaca ggcagagcag ggaataacaa cattcagtcc ccaaacaaac 22980 
aaaacaggaa tgggacctgt cttgcacggg gaatggctct tgttgcaatg atatagctca 23040 
agaggccttc actgtggaat ttcctgcctt gagcatgatg aattttccat tcttgtcctc 23100 
gaatgaacag tcgccataga gcttagtggc tggcctgtga agctcagcaa ggccctctgg 23160 
gaaatgggca gcttcgcctt ggctgagcct ggtcagaggg gcatgcctac ctttccttga 23220 
atcagttgct actgtacgcc ttaaagtgcc atgagaacca tcatcctcaa atgcttaaag 23280 
actgagctta aagccatctt ccttttctac ctcatcttta ttattattat tattattatt 2334 0 
gttattatta ttattttgag acagagtttc attccatcac ccaggctgga gtgcagtggt 23400 
gtgatcttgg ctcactgcaa cctctgcttc ccaagttcaa gtgattctcc tgcctctgcc 23460 
tcccgagtag ctgggattac aggcatgctc caccacgcct ggctaatttt tgtaatttga 23520 
gtagagatgg ggtttcatca tgtttccagg ctggtctcaa actcctgatc tcaagggatc 23580 
cactcacctc aacctcccaa aatgctggga ttacaggtgt cagccaccgc accacctatt 2364 0 
tcctacccca tcttatttga ctccctttca tgcaccagac actccagcca tatcagatgc 23700 
atacccgctt tccacgagta ggccattttc ttgcattcct gactttgctt attctattcc 23760 
tattgcttgg accactctgc ccacatcccc atttctgccc atccaagagg cagtatgcac 23820 
agtggtgatg agcatggaat ctgtctaggt ttgattctgg gctccattgc ttaccatcca 23880 
tgtggccttg ggcaagtcat ttaacctctc tgtgtttcta tttccatgtc tgttaaaagg 23 940 
gtgtgataat ggcacttact ttataggatt gtggtaagac ctaagtgagt tactatttgc 24 000 
aaggcactta gaagagtatg tggtacatag taagcattct gccaagcatt aggaattaat 24060 
attgaaatcc taccctcttt caaggttcag ctcaaatgcc acctccccag tgaatctctc 24120 
ctgatccacc ccagtggaaa tgttctctcc ctcctctaac tttccatagc cttttaacca 24180 
gatctctcct ggggagctac atgcctggta tcataacaat gtatgcacat gtatctgcta 24240 
cccgagggtt gggactcttc cataacttcc aaagggcctt acacccagta gttgctcaac 24300 

21 



BNSDOCID: <WO 991 1799A2_L> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11799 PCT/US98/18426 

agatcctggg agaaggattg aatggagttg aaagggcgat aacacgaccc aggaggagag 24360 
cacagctgct tgttaagcct ggccggcctc cccaccagaa agtgagggac ttgactcttc 24420 
gccagcgagt tatgtgtgtc acagttggta ctggctcata tagatctgtc actcctccct 24480 
tatcaggcct tcctggtgag cactggcaag gcttggggca aagtcaacag ggaacacagt 24 540 
gaagggatat gccaagcagc aatgttgtcc cctgatggta accatgaggc atcctggacc 24600 
agatctgggc tgctttatgc ccacctgaat cccaggattc tggctctaca gtagagccag 24660 
atcaatcttg aactaagagc caaagcctac acattccctt ccctcaaaat gcaacccgct 24720 
ggtccccttc tcatatgact tacccagttg ctgctgaact tgggcctcca gatgctgccc 24780 
gcttgctttc ccttctttcc ttcctggcca aagattctag acatagccag cactgccttg 24 840 
agtcagatcc atcatccatt gaggcaggac tcagcacccc cacatacctt taactactct 24 900 
attatctgat ttctgttctt tgctggtgct acttcttgct gaaatgactt ctctgtgtgc 24960 
tgcaagtgac catgactttt ttgtatacat catagcacat agcatggagg tagatgttgg 25020 
ctttgcctgt agtagaagct tccttaattc tttttggttg aatcaagggt tgggagatgg 25080 
gaaataaagc tatagtcatt ctggttatat ttttcttaac ttttacgatt ttgatcatgc 25140 
gagtggatct ctgagtcatc agggccctct atgccccttg agatctaaat gacattgctt 25200 
tgagccccct tctacccctg tgccatgaac agtgcaagtc aagaccccag cccaagatat 25260 
ttttgaattt ctaagtgagt tactatttgc aaggcactta gaagagtatc tggtacatag 25320 
taagcactct gccaagcatt aggaattaat attgaaatcc ttgaatgaac agttgcctta 25380 
gagcttagtg tttggccagt gaagctcagc aaggccctct gggaaatggg cagcattgtc 25440 
tgggctgggt ctggtcagag gggcatggct actttttctt gatccaggcc tggggtgagt 25500 
gttgctactg caccccttaa agtgctgtga gacccattgt cctccagtgc ttaaagactg 25560 
agcttgaagt caccttcctt ttcccactct aagccacctt cccttagagt ggaagaagag 25620 
ggaagagtta tatcctcatg gccttcttac cacccacgtt ctgcagctct gagcttggtc 25680 
tctgctttca cactgttgac caaagcacca gacctagacc ctttctcagt tcccaccaag 25740 
aaaatgtgag tagtggaatc tactaaagat ctttggtggg atgaaatcct gggaggtgga 25800 
tgtggtcacc tcacacagtg gacagccttc ccacacctct ctggctcctc tgtccccttt 25860 
tctccctgct cctctctttc cctccagggt ttccgaagtt gctccaaatt cttcccctgc 25920 
cccctgaggc cttggcacct ataaagggta ttaaatcagg gatgggtatc actttctact 25980 
cttcagagct tatctaagta ttttttacca acataggtag tcttgccctg tgcattctgg 2604 0 
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tgccagccct taagcaatat atgagcaagg cccttcctgt ctctggacct tagtttcatg 26100 
atctgtaaaa tggactggaa tcactagacc agccctgtcc aatagataat aatgcaatat 26160 
gtaattttaa attttctagt agccacattg aaaaaaataa actgaagaac ttagttattt 26220 
ttatccttta gagtagccac ggggaagcca ctgaggaatt tttaaacagt gttattgaga 26280 
tacaatttac atataataag gtgttcatat ctaaagtgta caatttgaca cgtttcgata 26340 
tatgtagacc tccctgaaac catcaccaca atcaagaggg tgaaccacat gcaacattcc 26400 
cggaagattc ctcgtgtccc tttgtaatta tgtatattta atagtatatt tttaatttaa 26460 
tttggtatat ccaaaatatt atcattttaa catgtaatca atataaaata ttagtgagat 26520 
cttttacatt cttttttttt tttcatagaa agtcttcaga ttccattgtg tgttttaccc 26580 
ttggagaaca ttctgttttg tggtagccac gtttcaagtg ctccatagcc acatgtggct 2664 0 
catggtgacg gttctggaca gcaaggtcca tgtgatctgt aagcaccttt ccctctctca 26700 
tgttgaaggc ctccatgtgt ctatatttcc tgacgtgtgt tcttatcatt gattactact 26760 
gctgttgctg ctgtgtgcac agcccaggag gtggccttgc tgcctgccat ctggtgggga 26820 
cccatagtcc ccaccacccc acctcggctg gggcaattgc aggaaaacca cttgttggaa 26880 
accctcttat accatcgaat tccagagtag gctctggatg gggccatctc tgttaacaac 26940 
agagttgagg tagatcaatt gtaaggtgtg ttactaataa aaagtatcaa agtttgcaaa 27000 
gaagtgactt tgcatcataa aagaggttag attcaagatt attcttatta caaataaggg 27060 
aggcgttatc cctagaataa agtcactttc cctctgaccc atattcttaa atgggagaac 27120 
aaagggggca gaggatgcta ttgcttctat gctgggaacc ttagcctgtc ttccctgtgc 27180 
tzaaaatcttg agcgacgtga aggttcacta agggagtggc cat gat tact acaaatttgg 27240 
aggacaggtt attatcacaa cctatgtcaa tgggagcatg cttagagggc gctgcactgc 27300 
aaaaaatatg gaggaaacat gtaacagggg actaaggtga gacagattct ccagacagag 27360 
aggcctcaat ttgcctgggg cctggtgcta ttcgctgaag cacctataga agacttggag 27420 
gctaagcaat gcccagttag cccgccctgg tgtgggccca gtgaaacagc aagagggtaa 27480 
gtacaccttc caggcccaga tgccctcagg accagcccta cttttggcaa aggaagcata 27540 
agcctgggtt caggcaggga aagagctaca gacagttgtc tttggctggc acagtagttc 27600 
acatctataa tctcaccaat ttggaaggtt gagtcaggaa gattacttga gaccaggagt 27660 
tcaagaccag cctgggcaac atagaaagac ccccatctct acaaaaaata aatcagccgg 27720 
gtgtggtgac accggcctgt agtcccagct actcaggagg ctgaggcagg aggattgctt 27780 
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gaaccaggag gtcgaggctg cagtgagcca tgatcgcact actgcattcc attctgggtg 27840 
acagagagag accctgtctg taaaaacaaa aacagaaaaa aagacaatgg ccctaatctc 27900 
cccctgcttt ccctaggcct aaaaggtacc ctgcctttta ggcagggtga accgggtggg 27960 
ggctgcctgc cttcctcaga gagagggaag gaagcaagga gtagggggtc tgactggggc 28020 
cctacagctc ccaacctcaa gacctgcctt ttgaggccat agagaagctg cagctttgct 28080 
cttgcagctg cagcggcagt gaagaaagca ggagggcatc ctgaggcggg aaatgcctgt 2814 0 
cagtgtcatc cccagctgct tccggcttcc tccttcaagg attccagggg ctcctctgta 28200 
accctgccaa cccctctccc ctgccccagg ttctggcagg cagctgtgcg ccccccctca 28260 
gggcccacgt cacaagtcct cagaggggct gtcaactccc cattgttctc ggggcttctg 28320 
gggctcctcc ggcattcctt tggctcatga ggggaaatgc ctgaagcttc gtcttcacct 28380 
cttcagacgc ttgacctaat agtcacccgg ccctcctggc ccctcaaggg atgctgtggt 28440 
gcgcgggaga atctggctag gctagcacta caagtacact tacctcaggt caaagatgaa 28500 
acctgagggg gcacctaggc ctctgtgtct tgaactcacc ttgaaggccc cagccaagta 28560 
ttcccccctc tccctcctga gtattgcaca cagctacccg cagagttaca taaacacagc 28620 
catacccctc cattccaaat tctacccatt ccacacagtt ctgtacatgc tctcacatgc 28680 
acactcccac ccaacccatc tttccagggg agccatcagg tgtgtcaggg ccagaagcta 28740 
cttttgccgt gggtacagga ctgacattct tccaggaagc ctcccctgac tgacaggcaa 28800 
aacttcctgg gtagaagctg accccagtcc ccatctacta tttaaaatat ggtgggtagg 28860 
gaaggcttta ccggagaagg tgatgtttaa agcaaatact aaaggatgtg aaggagtaaa 28 920 
caatgccgat attcaaaaga aggacattca gggccaaggg agaaacaaat gcaaaggccc 28980 
tgaggtagga atgtgcctag tgtgaataaa gaataacaag gaggccagtg tggggtggag 29040 
ggctcgaagg aggaggagta ggaggtgagg ctggggaagt gatggagacc agatcctgtg 29100 
gggccccatt tgcctttgta ataagagaga tgggagggag ttaccagagg gttctgagca 29160 
caggtctgat atgatctgac ttagattgta agtgattatt ccagatgttg tgttgagaat 29220 
agactgtagg gggacaaggg tggaaacagg gagattggtt gcaataattt cctgatctct 29280 
ggcaataatc catgtgaaag gtgatggtgt cttaggccgt ggtggtaatg gtggacgtgg 29340 
tgatacgtga gcagaatata gatatattct gaaggtagat ggatttgtca atggattaga 29400 
tggaggcggg atgtaaagag aacaatcaaa gatgactcaa aatgtttctg cctgacaaga 29460 
gaaaggcata aaagtcatcc aaataaaaaa gaggacatcg ggccaggcgc aatggctcac 29520 
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gcttgtaatc ccagcacttt gggaggccaa ggtgggtgga tcacgaggtc aggagataga 29580 

gaccatcctg gctaacacga tgaaaccccg tctccactaa aaaatacaaa aaaattagcc 29640 

agtcgtggtg gtgggcacct gtagtcccag ctactctgga ggctgaggca ggagaatggc 2 9700 

gggaacccag gaggcggagc ttgcagtgag ctgagatcac gccactgcac tccagcctgg 29760 

ctgacagagt gagactctgt ctcaaaacaa aaaaagaaca tcgaattatc tctattcatt 29820 

gacaatatga ctctacacct agaaaattct aaagatttca cctaaagact gctagatctg 29880 

ataaacaacc tcagtaaagt tttaggatac aaaccaatgt acaaaaatca gtaacatttc 29 940 

tatatgtcaa taacattcaa gctgagaacc aaatggagaa cacaacccca tttataatag 30000 

ccaaaaaaag aataaaatac ctaggaacac agctaaccaa agaggtgaaa ggtgtctaca 30060 

aggagaacta caaaacactg ctcaatgaaa tcagagaaga cacaaacaaa tgaaaaaaac 30120 

attccatgct gatggattga aagaaccaat attgtgaaaa cgaccatact gcccaaagca 30180 

atctacaaat tcaatgcaat tcctatcaaa atgccaacat catttttcaa ataattagaa 30240 

aaaataatct taaaattcat atgtgtatta gcctgttctc actctgctaa tgaagacata 3 0300 

cccaaaactg ggtaatttat aaagaaaaag aggtttaatg gacttacagt tccacatgac 30360 

tggggaggcc tcacagtcat ggtcgaaggt gaaggaggag caaaggaatg tttcacatgg 30420 

tggcaggcaa gagagagcgt gtgtagggaa actcaccgtt ataaaaccat cagatattat 30480 

aaaacttact atcatgagaa cagcatggga aaaacccgcc cccatgatta aattacctcc 3054 0 

caccaggtcc ctcccatgac atgtggggat tatgggaact aaaattgaag atgagatttg 30600 

ggtgggaaca cagccaaacc gtatcaatat ggaatcaaaa aacggccaga ataaccaaag 3066 0 

caatcctggg caaaaagaac aaagccagag gcgtcacatt acctgacatc aaattatact 30720 

acaagggcac agtaaccaaa agagcatggt gctggtacaa aaataaatac agacacatag 30780 

accaatggaa cagaatagag acccctgaaa taaaactgta cacctacaac caactgatct 30840 

tcagcaaagt ggacaaaaat aaacaatggg gaatacccaa taggacaccc tactcaataa 30900 

atgctgctgg gaaaactggc taaccatatg cagaagaatg aaattcaacc tctacctgtt 30960 

accgcataca aaaattaaca caaggtggaa taaagactta aatgtaagac cataaactat 31020 

aaaaatccta gaatcaaact taagaaatac tcttctggcc attggccaag gtaaataatt 31080 

tataactaag tcctcaaaag caaatgcaac aaaaccaaaa attgacaagt aggacctgat 3114 0 

taaactaaac agcttctaca catcaaaaga agccatcaac agagtaaaca gacaacccac 31200 
agaatgggag aaaatagagg caaactatgc aaacaacaaa ggactaatta atatctggaa 31260 
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tctattagga acttaaactc atcaacaagg aaaaaaacaa acaaccccat taaaaactag 31320 
acaaaggaca taaacagaca cttctcaaaa gaagacatat gagtggccaa caaacatatg 31380 
aaaaaatgct caacatcgct aatcagagaa atgcaaatca aaaccacaat gagataccat 31440 
ctcacaccag tcagactggt tcctattaaa aagtcaaaaa taacagatgt tggcgaggtt 31500 
gcagagagaa aaaaaacact catacactgt tggtgggatt gtaaattagc tcagctcctg 315S0 
tagaaagaag tttggagatt tctcaaataa ctgaaaatag aattacaact tgacccagca 31620 
atcccattac tgggtatgta tccaaaagaa aataaatcgt tctaccaaaa agacacatgc 31680 
actcgcatgt tcatcgcagc actgttcaca acagcaaaga catggaatca catcagcctt 31740 
ggtgcccatc agtgtgactg gacaaagaaa atgtggtata tgtacaccat agagtactat 31800 
gtagccataa aataaatgaa attgtgtcct ttgcagcaac atggatgcag ctagaggcca 31860 
ttatcctgag tgaattaaca caaacagaaa accgaatacc acatgttctc acttataaac 31920 
agcagctaaa tattgggtaa acacagatat aaggatggga gcaatagaca ctggggactc 31980 
caaaaggtgg gagggaagga ggagggcaag ggttgaaaaa ctacctacca ggcgctatgt 32040 
tcaccgtctg ggtgatggaa tcaatagaag cccaaacttc agcatcacac aatatatcca 32100 
tgtaacaaat ctgcacatat atcccctgaa tctaatgttt tctaaaaaaa ggtttctgcc 32160 
tcagccacat gggaattgtc atgtactgcc atttacttca aagggaccag gtatgcagga 32220 
ggagaggatc aggcaatcag cgctggtcac attatgtttg agaagcctat tggacatcca 32280 
aagggagatg atggataggc agtttagtac atgagtctgg agctcaaggg agatgcctat 32340 
aactttccta ttttgttgtt gttattgttt gtttgtttgt tttgttttga gacagagtct 32400 
cgctctgttg ccagactgga gtgcagtacc acaatctcag ctcactgcaa tctccgcctc 32460 
ctgggtaagt gattctccag cctcagcctc ccgagtagct gggactacag gggtgcgcca 32520 
ccacacccag ctaatttttg taattttctt agaggtgggg tttcaccatg ttggccagga 32580 
tggtctcgat ctcttgacct cgtgatctgc ccacctcggc ctcccaaagt gctgggatta 32640 
caggcgtgag ccaccacgcc cagctcctgt gtgtttttta tacttgtcca gtgcattgta 32700 
tctataaaca attcacagga ttgctttatg agtgttcgaa ctttccatca atgacttcct 32760 
gtacgtatca ttttgcaact cgctgttttt gcccaacatc aagtttttga gatttttcac 32820 
attgaaacat ctagctcttt ccatttaact ccatagtatt ccatcatata aatatgtccc 32880 
aatttacgta cttactgccc tattatcagg ataatcgggg ttgtttggac ttccgaatgc 32940 
ttctttatac tccacagagt gtgatataac gctttgtggt gctgggcaca cagatagtaa 33000 
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atgattgagt atttctgttc acctgcatct gtgtgcctgg gatcattaat tcccaggcaa 33060 
cagggaacac atcatatctg ttgatttcac aaggacttat ttactaagct aggaccagct 33120 
agatttctcc ccggtgatat ttggttagct gtgttgtcca aggtgctgga aagtaacagc 33180 
aatagggcag gtgccatggc tcacacctgt aattccagta ctttgggaag ctgaggtggg 3324 0 
cggatcacat gaggtcagga gttcaagacc agcccggcca tggcaaaacc ccatctctgc 33300 
taaaaataca aaaattagcc aggtgtggtg gcgcacacct gtagtcccag ctactcggga 33360 
ggctgaggtg gaagaatcgt ttgatcccag gaggcggagg ttacagtgag ctgagattac 33420 
atcactgcac tccagcctgg gctacacagc gagactctgt caaaaaaaaa aaaagaaaga 33480 
aagaaagaaa gaaacagcaa tggtttttac tttgctaatt atcgcagacc ttagtggctt 33540 
aaaacagtga gtttattatt tcacagtttc tgtgggtcag ggatctgagt acagcttagc 33600 
cggaccctcc gcctcgggcc ccctcacaac ctgcagtcaa ggtgttggcc agggctgcag 33660 
tcacctcaat gctcaacagg ggaagaatcc atttccaagc tcactcacat ggctgtcaac 33720 
aggagccctc aggtgctcac tggctatcag accagagcca cagtttcttg ctggctattg 33 780 
gctggctcag ttccttgcaa cacagaactc tccagaggac agttcacaac atggcagctg 33 840 
gcttcctcag agcaagcgag tgagagatgg tgagtgtgtg agaaaggcag gagtcagtct 33 900 
cacaatttaa tcttggaaat ggcagcccat catctttgcc atattctatt tgttagaagc 33960 
gagtcactta gtccagccca tatgcaaggg gacggggtta gacaaaggca ggaataccag 34 020 
gatgggggat tactggggac tatcagagaa gcttccaagc atggaaacca aaggactgag 34 080 
gtgggggcag gattgattgg ctgaaatgcc tgcagagacc ttgaggagga tgaagagtga 3414 0 
ggaggggcat agagtgaaag ccagagagca ggagtaggag gatggaaaac gtctggtgta 34200 
gaccatttgt ctcagaactt tggtggggaa ggtaaagtgt aagaaatgtc agtggaggga 34260 
caggggatct ctgagcacat ttgagggcca aggggaagaa gccaatgcaa atggtggagg 34320 
ggaaggggaa atgtggacgc caggctccta ggagaagagg aagggcatct tgtcacctac 34380 
ccctgaacat ttctacagtg ctctgtaggt aaaaaagtac ttttccataa acagtttacc 34440 
cttgaacaat gtgggtttga ggtgcactgg tccacttatt tgtggatttc ttttttcaac 34500 
caaacacaca ctgaaaatag agtattcgca tattgtgaaa cctgtgtaca tggaggggtg 34 560 
actttttgtg tatgtgtctc aaagggctga ctgtgggact tgagtataca tggatcttgg 34620 
tatatgcggg gtcctggagc taaacccccg tggataccaa aggatgacta tacaggcttg 34 680 
cttgagcctc ctatcaatcc agtgaggtta gcagaacagg gatcgctttg tccctatttt 34740 
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agatgaagaa acagatgcag agaggtgaca tggtttttcc taggtcatat aggcaataaa 34800 
cggccaagcc aggactagaa ttcaggtatc caatctctaa gtgtttctag tctagtgctg 34 860 
gcttctcatt aggcctcagc actttcagca tgctgctcct ctgctcaagt cctctccctt 34 920 
gtgtctgtaa ggtcgagaac tctcttcccc cacagggaac tgaatgagtt tccagttcaa 34 980 
gcaggccagg caatggctgg aaatgagtga gaattccttg actatcaatc aaggggctgt 3504 0 
aacccaaatt aaggttgaag ttgtgagagt ggtgtgggga agcccctgca tagtgggctg 35100 
gtaagatgca cgccacgggt cagtgggact ctaggcccag tgagggagtg ttgtggggag 35160 
catccacgtg ggccaagtaa gggccggcat ggaagacagg cgggctggga gcctggctct 35220 
cactgcgaat gagctgtccc cattatttcc tcttccagcc actcagtccc gctggctctc 35280 
caagcagctg acaagcccag taaaagagtt tccatccagg cctggaaggc cccagtgaga 35340 
gctggggagc tgcttttatc cagccttggc aaggcctttc agctggagtc cagcccactt 35400 
tgggcaacat ttggaaactt actgaggaaa tggaggtctc ctttttagga atcctgcccc 35460 
acccctgctc ccaccatggg ccttagtttt tccccaggaa acctggagca ccttgagact 35520 
ggagactgtg tctaccccag caggcaggaa accaggggct caaacttcca agccctgccc 35580 
tttgtgagct cctaggatac acacaattca ttctgaccct gggacaagga gccagtgaac 35640 
cagtcatcta taacaacctg agctacacag ggccccagag aaaccttcca gaccctcatt 35700 
taccagctaa agaaattgag gtacagaatg gggagggatt tggtccagga tcacagagcc 35760 
actgagtggc agaatgggag ggccatagtt tggaggagac tcgatgctca agaaaaaggg 35820 
actgtcaaac agttaaggtza ggagttcctc taagcagagt ttagagaatc gatccaagaa 35880 
acacttgaag cgtgcgagtg gcatagacta ggtgctcaac aaacgcagca taaataaata 35940 
aatgaaactg attgttgctg caggggagct gggacggact aggccacgag aataaaaaac 36000 
tctgccctgg tctgggtcct gctgggtcca gatcagtgac actagaatag ccagcctggg 36060 
cttttttccc atgggaaagt aggacaactc tgaccagaaa ctggggtaat ggcaatgagc 36120 
ggagggagta gattaaaggg agagatgccc cacccttact tgggataaag agctcccctc 36180 
ttgcttcccc cttcattttt actgattgct tacttggtgc caagcattat cagaaacaca 36240 
cacacacaca cacacacaca cacacacaca cacacacaca cacatagcat taacacacat 36300 
ggtccctatg aggcaggcac tatcattgcc tccatcttat agataaggaa actggccccg 36360 
agaagttcag tgccttgccc ttagtctccc agctgataag aagtaaagct gggatctgaa 36420 
cccaggccat ctgacttcaa tacctatgct tctaaccact aggttatact gccatttcgc 36480 
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tgagcaaggc agcttgtagt ggagctgatg cccctgcgga ttgaaggccc actcattcac 36540 
acactcacag ccacccctgc ttggaggtgg catagcaggg aacaagttgg agttgccaga 36600 
ggccccaagg tctccaccct acctctatct tggagcttaa agtatctcgc tcctggcctc 36660 
ttttaggaat ggtagggcca ttagcccaat gaaggctttg ccatgagttt ctcagacctt 36720 
tgcagggata gcagggccct aaggagactg aaccagggtg gaaaaatctg aataggagtg 36780 
gaagaggagg gtacagggca gcccattatt atctctggtc atttaacact tcttaaacac 3684 0 
ctactatgtg cctgtacagg gggctggaac tacagcaatg gaccagagac ctgtccctgc 36 900 
aattagagag ctcacagtct agagggggag agcaactagc aaacagacta cagagaggga 36 960 
aactgaggcc cagagaggtc aaaggccttg cccaggaata ctcagtgggt gagtcacaga 37020 
gctgggccca gagtcagtgc tcagactccc aatccagcat gatccctgct acagcaaaag 37080 
caagcaaatg caaacaagct cagatgaaag aggggtgtta tgaactaagg agaggcatga 37140 
ggcaggcagg gttaggcaga agggctatct tactggaggt ggagttcaaa ggagggtgct 37200 
acagtctgaa tgtgtccccc aaaattcatg tgtcgaaacg taaagcgaat atgacagtat 37260 
taagaggcag ggactttcgg aggtgagtca tgagggtgga gcattcatgg atgggattag 37320 
ggcccttata aaagggcttg aaggagttgg tttgttggct tctcctcttc tgccatgcga 37380 
ggacacagca ttccttccct tttgctcttc tgtcctttcc accacctgag gacacaagtg 3744 0 
ttcctccctt ccagaggatg cagcaacaag gtgccatctt ggatgcagag agctgccctc 37500 
aacagacaac tgaacctgcc agcagcttga tcttggactt cccggcctgc agaactgtga 37560 
gaaagagatt tccagttata agtgacttag tctcaggtat gttgttatag cagcacaaac 37620 
agactaagac aggggaagga tgccctggtg gaggggtgcg actggagaaa gctcagtgct 37680 
cagtgagccc cagtgaagtc ccaggaaagc cactggcagc agcccacacg cttgggcggt 37740 
tcctggacag ccaaaggagg aaccacaccc accaggccca ccagccttgg acgtggccca 37800 
gcaaggccaa gcgtttagta attggcgaag tgactgtctg ctgccgcccc agtccaaatg 37860 
aataattatg gaatgtttac atgatgtggt gttcctgaca ccttctcaag tttccatggc 37920 
aactaggaac caccgcccca cccaccccca acccaaccca caagatgttc tgtctgttgt 37980 
ggccagcagc tgtgtcatgt tgtgaataaa taacacatgc ctggccttgg agactggagt 38 04 0 
taccctacag tcatccctga cagggcagac ttagtgaggg gatggggctg cagcaggcag 38100 
taaggcgtct ctaatgcttt gaatgcattc acctgcctca gcccaatgtc accagagata 38160 
ggccatgggt cagcgttcat cccagtggtc cctcctggag ctagtaagga gacgtccctg 38220 
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ctaaagagat gttcaatgtt taatacaccc aagcttcagt ttactcccct tgagcttttg 38280 
ttttctcatc tgttgaagat acccatgcct gtctcaataa ctttttataa agaagagata 38340 
agatgacaac cagggccagg cactgtggct catgcctaca atcccagcac attgggaggc 384 00 
tgaggcgggc agatcacctg aggtcaggag ttcgagacca gcctgggcaa catggtgaaa 38460 
ccccgtctct actaaaaata aaaaaattag ccaggcgtga tggcacacaa gagaatctct 38520 
tgaatccagg aggtggaggt tgcagtgaga tcgcgccatt gtaccccagc ctgggcaaca 38580 
gagtgaaacc ctgtctcaaa aaaaaaaaaa aaaagaaaag aaaaaagaaa aagatgacaa 38640 
ccaaccagca tagaaagcat atcctaccat aatggctagc acttagtagg tgcttaaaaa 38700 
tgtcacatct ggaaggttcg ttctaaacat ttgtgcctgc ccccacaacg gaaacttctc 38760 
tcctccccag atccccagat ccatgaataa gtggaagggc agaagctcca gtggtcttct 38820 
ggaagtcaga gaacccaagt tagaggtcaa taacgtcctg gtggattggg agtaacccta 38880 
caaacgtgag actcttgttt tagctctgac tgtcctaatg gctggcgatg gggccttggg 38 940 
caagtcactt gccctttctg gttctcagcc tctctgtctg gactatgaga tggagtggcc 39000 
tagaccagag gtttttcaaa ttgtgttcca cagaagcttg ggtgcctcag ggaactctgt 39060 
ggtagggaga gggatgagac tgaatgggtg gggctctggg cccctgagcc tgctttgatg 39120 
gaatttatat gtgccaaggc tttaaaaaaa cagtttgagg ccaggcacag tggctcacgc 39180 
ctgtaatccc aacactttgg gaggccgagt caggtggatc acctgaggtc aggagtttga 39240 
gacgggcctg gccaacatgg caaaactccg tctctactaa aaatacaaaa ataaaaataa 39300 
aaataaaaat aactgggcgt ggtggcaggc acttgtaatc ccagctacta ctcaggaggc 39360 
tgaggcagga gaatcacctg aacctgggag gcggaggttg cagtgagccg aggtcgtgcc 39420 
attgcactcc agcctgggca agagagcgag actccatctc aaaaaaaaaa aaaaaaaaag 39480 
tttgaaaact ctgaccgcat gcttttggct ttgataggga aagagagctc cttggccact 39540 
tcctcaatcc cagagtgcat gagttccctg gcctgggagc agatacacca agagaggcac 39600 
tggatcccaa aggtacccca aagctccatt ttctcccaca ccacccatcc tgagctgtct 39660 
ttctgtccca gaactcctct ctgacagttc caactgccca aaaccaggct acttccacca 39720 
gatgccctga ggaacagtat ggcaaggggc cctgggcagg gaagaggctg aggaggagga 39780 
acaggccctg gccacctctt tcctttagga gggtacgaca gttcccagtg gcccaacctc 39840 
ccagggcgtt ctcccagagt gtggagggcg ggggcctaat ctgcatttgg gacaaagcct 39900 
acccggtgct cgagtgttgt tgtcatggaa actcaaaagg tgttattagc atcttttcaa 39960 
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ctcagtggcc ctgtgtggtc cagtccatgc aatcttagca gtagcacacc atcaccctgg 40020 
caacccatta gcctagtgtt gttaccatgg aaactccagt aaggtcgatc agtccgggaa 40080 
gttcccagag gagataggca ctgggagatc tgcccagcag tgcctactct gagggtctcc 40140 
cagctgacag tgacctgctc tgcatacact gacagtagtt taggatcttt tccggagcga 4 0200 
gctaacagtc catgtgacag gaaagaattt ggccctgcag actggcaggt ctatgcctcg 40260 
ttaccatgga aatgctgctt gtcaggatct caaagcacag tgcaaaacta actcttccag 40320 
gagactgagg ctggcaggaa aggaggctgt gagcctgttc ttcattcaca tgcccattca 40380 
ttcattcatt cgttcattca gtatttagtg cacgctcatt aatacagtgg gcactgcgta 4 0440 
tgtccgaggc tctaactgga gcaaacaggt gggctcacca actctcttca attgaaaagt 40500 
atactgaggt ggaggggacc atcttacttc ctgtgtcaac ctccactgga ggaaaaggag 40560 
ttgacatgcc caaaggaatg gaatcggggg gtctctgtgt gtcgaggggt acagggagaa 40620 
tacttgcttt ctgtagcctc ctgtttttca cctatcccct gcttaagcct tcaggaaaca 40680 
gatctttgtg tggtctggat caccaaaggc ttggccagac atgctagaag tttccagcag 40740 
ccacagctaa gtgaagttag aacaggctac gttgggaagc aatgagctcc ccatcactgg 40800 
agatgtacaa gcagaggcaa aaaatcctgc tcagtgaggg ccatgtagag cccgcagagc 40860 
acaatgattc agagctcaga ttctggagcc agatagccag tcttcaaatc ttggctcagt 40920 
ttaccagctg tgtgactttg ggcaattcac ttaatcatac tgagccttga tttcctcatt 40980 
tgtaaatggg aatgataaga gtttcttcct catgggcttg ttgtggggat gaaatgcgta 41040 
taatatataa agcacttaga acagcaccta acatataatt gctcaagaaa tggtagcttg 41100 
cattatccct tgatggccca actaaagtcc ctgacaagct tgggattcta tatgactatg 41160 
ggtcactaat cttcctgcac acggaaaatc agaaagggac tctggcaagg gtaagattct 41220 
gtgctgacca gagagctggc tgagggtagc acctctgagc tttgagctca tcaccacttg 41280 
cgtgcttcca gcaatagttt tcctttccaa gtacttttgg atccatgtct caattgggcc 41340 
taaagacaac agcaaagggc acagttacca tcccctgtga tagataatgg gaagtaaagc 41400 
taagagcggg agagtggctc acctaccaag tggcacaact gggatggaag ctttctctct 41460 
gtcacaccac ctcacagtcc ccagggccgc cataagggga ggacagcagt tgggaaggag 41520 
caagtcacgg gctgggcgga ggaggagccc agcatgacaa taacctatga aaatcagctt 41580 
tccactgtct gcacttctgc aaggaggagt tgaggccaaa gccagggagc agaagggggt 41640 
atttcctaga aatgcggcgg gagtggctgg tgtgctccac ccatgggcaa aggacaaggc 41700 
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tcggcctgcg tgtgagcact cacttggagc tgaacttgga gcctccagag ctcccatccc 41760 
tctgagctca ctcagaaagc tttccctttg ggcaaagtcc aagggggtca ttcaactgtg 41820 
atctgacact tctgctttca acttaatagc tcaagtattt aacggatggc tcatggcctc 41880 
accaagggat tttgcttgcc gagactagca aggtgggcta gtgttttcat gccatttgga 41940 
ggagcaatga gaaaggattg tttaaaaaaa aaaaaaactt tgcctgagaa caagagtaac 42000 
gccctaactc atggcctcat gaagctccag gttaaaatgc acacgttacc tgggagtatg 42 060 
ggaattggac aggggcaata gtgaaaaatc tttaatagcc agcaaggtct ggacaccaac 42120 
tgattagaat gtttgctggc cagaaacacg ctatagggat gcaccagccc agatagagga 42180 
aaacagcagc cctggatgtg ggccttggca ggggccgggt gaagggaggg tttgctccct 4224 0 
gagaggtccc atgagcacac gcaggaatca gatctgccta tggccaccag aaggtatgag 423 00 
atttcccaca gcagctggtg gtgggggggt gggcagggtt ggtgtatctg gtagggggaa 42360 
aactggtggg aacatggggt tcactgaggg aagagctcct tgggacagct cgagggccag 42420 
acagcctagg acagtcctgt aaggatgcaa atggggcctg gcctggcctt ggagacccag 42480 
atgctgccct ctgccctcac agacttcacc cctggtgcct ctgggtggca acttctctcc 42540 
ctctctcagt gaacatgcat acggactata atatgctaca cagcgtggag caaatatgca 42600 
tacccattcg gtgttcacac aagcacatgc atggagccca cgtatgccca tgacctgtag 42660 
aggcatcctc tcaagtcagt ggggcacacc cacaaaattg ttcagactca gaatcttgga 42720 
gcagaagcaa ttttagagct gcccccatta aaaaagtcgt ctttgatgaa cgccagtcag 42780 
ccactcctct acacacactt tggcgtttgc tcgttctcac acctcctgca gaacaggaaa 42840 
gaggagatga cctcgggaac tttcttaaaa tggcctgtgc ctagacttga cgcaagcctg 42900 
aagaagtccc tgcagttcca ggactcagcc agcagggggc agcctctcat gacacactgg 42960 
gtttccccca aagaataagc aggcttggag aagaaagaga ggccgactgc aaagaaaggg 43 020 
aaacatataa gaagtttcca gagtgtccgg ccatcaagct gacccagatg catgtagcca 43 080 
gccactcctg tgggtgagga attctcagcc cctcatattc agctggtgaa ttgatagagt 43140 
ctggggcaag ttatcagaag aggatgtctt agtgaccata gtacatcctg cttgcccagc 43200 
ctagagccac agagggaatc agccagcaca caggagttca gtggaatctc tcattctgca 43260 
agtcacagcc tctctcaggg cctgcttcgt tgtctgtgaa atgggtttaa caataaatca 43320 
catctgcttc ctcccacctt ggtgctttgg acacaagccc tgtgctccct ctgctgagac 43380 
ctcctttctc cagcttaacc cctactcact cctgaggtgt cagctcccac ttcacctcca 4344 0 
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ggaagctttc tctgacccca gatcgtgcac tcagtgataa acttcctaga ggcctgtttt 435 00 
tcgcctccag ggcccttgcc acagttcctg gttagacatt cggttgtgat ttttgaataa 43560 
catctgtttc cctctccagg tattaaggaa tgtggtacaa agtaggccct caataaacat 43620 
tcagcgaatg ctgagaaatg aattcccata gctcttttac agatcaaaaa gcactttcac 43 680 
atgcaaacag ggtggtcttc atggtacccg ctgagaacga tggggttttt tccccgttac 43740 
acacacagca ctctctactg taccaaacat tttaacttca gaaacttcag tcactagaag 43800 
cccgagaggg agccagggat tcttacttcc attttagaaa aggtaaagct gaggcctaga 43860 
aaggtgaagt gactgaccca ggcttggaac ttggaggagt caggatgttt aagtctccgc 43 920 
atcccatgct ggaaggtatg atcattacat tccctcatta tcactgtcag catcgtcacc 43 980 
tccattgtgg ctatggccat gcctagtctg gtgaggaggt gtacagttgg cgagcacaaa 44 040 
ggcttttgaa ggtggcgggc agaagtctgg gttccaattc tggcctcacc tgcaggtgac 44100 
catcggggct aggagagcat ccagggagga ttcatggggg aggggagcct tggctggcag 44160 
ttggaggacc cggagaattg gaggaaggtg gagaacctgg gaagggaagt ctggggaaac 44220 
tagaagagca ctggaaatgg gttttgggtg acttaggctg aggctttccc tgcttctgct 44280 
tcctaaaaga tgggtatgaa gctccctccc tcgcaggtcc tgctgaggcc ggtgcgcagg 44 340 
ctcagggcca ggagaacatc cctggggttg cgggggagtg gtacagaggg cgctgtggct 44400 
acaacagcta atgagggatg tcacaccctc accacacacc tttcgctggt gcctcccaga 4446 0 
ggaaaggtga ggaccaagtc tttctccact gttcctcctc cactgtccct tgcatggtgt 44520 
cctgcacaga gtaggcgttt tttttttttc ttttgctgtt gtttgttttg cttgtttgtt 44580 
tgagatggag tcttgctctg tcgtccaggc tggagtgtag tggtgcaacc tccgcctccc 4464 0 
aggttcaacg gattctcccg cctcagcctc ctgagtagct gggactacag gtgcccacca 44700 
ccacacgcag ctaatttttg tattttgtag agacgggttt ttgccatgtt ggccaggata 44760 
gtctcgaact cctgacctca gatgatccac ccacttcggc ctcccaaact gctgggatta 44820 
caggcatgag ccaccacgcc tggccaaggg taggcatttt ttagctgatt gttttggggt 44880 
aggggtaggc caaagagctc tttataggct tgggttctaa aagtttgggc atagattgag 44 940 
agaaggggag ctccttttga aacttacaca aggcatggga cctaaaactt gacaggcaat 45000 
ttgggagtgg gaatgggggt ggggcaggct gtggagctga gctagagggt agcccagagt 45060 
agagggaagg atgaaagagg ataatcagat cagaaggtct ctgcagtgct ggaggtggaa 45120 
ggggctgtgt ggtgagattc cctcctccat ccccctgcac tcctcactgg gacctttttc 45180 
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ctccctgctt tgtttatcct ctcttatctt tccatatccc tcacatctga ggagttggga 4 524 0 
atttccaaag gagagaggaa ctccaggctt ccagaaagct caggagccta gagacatggg 4 5300 
ggatgaaacc ccccataacg tatgccaaag tcaccagcct ctatccctac acctgctggg 4 5360 
gagggagcca gagatgtctc tgcctagatg gaaaaagagg caccctcttt ccaacaggcc 4 5420 
tctcacagcc cccaaatcac aagcagttct gttgccctgg gactgagggt gccaaagcga 4 5480 
agcccctccc ctacagacga gaggtgactg cagccctgga gggaatgctt ggcatttcag 45540 
cagtgccttt gccttccttt aaagcaatgc caccttgttt ctctcagttc gtttgcacat 45600 
accacagaag atctttagac taatggctta ccttctcagt gactcagttt cctcagctat 45660 
aaaatgggat tgtgatgggg actgactaag tcaggacagg tgaagttcct caaacagtgc 45720 
ttggcacatg agaagcgctc tacatgagtt tgctaggact atctagctat gttccttaag 45780 
cgttcactta tcctttctga gtcccacttt cttcatttgt aaaaataaca gggcttaggg 45840 
tactgaatga tctctcaggc tcatgtcagc tctgcccttc tatgattctg tgtgtttaag 45900 
gtagggaaag cagagattgt catcctatga aaatagatga gggagaacca ggacccctga 45960 
tagatgaaca gacctgcctg gagtcacaca gccagttact gaccaagctg gggttaggat 46020 
ccagagctca gaaccccaag tttcacacca ccaggctagc tctgcagcag tgtgccaatt 46080 
ttttaacatt cagggaagat cagaaagcat ttaatttgtg caaagggaga agtatggaat 46140 
ggaggttcca gcaggccctg gggttatggg aagcttgtgc aacccacgta acccagcagt 4 6200 
ggacttcctc actggcttag gttcccattt tcctgccata tgcctaattc ttctcccaga 46260 
gctcctccct acaggtccct gggtcaggtc ctgaattgcc tctggagggc agcctgaatc 46320 
accccccact gttagctggg accccaacaa atccctagtc acaagcaggt gcttcacatc 46380 
tacccaccca ggtgtcctgg gcaggtgagg aggctgagac ccagaaaggg ccaaggaaag 46440 
actttgctaa gctacccagc aagccctggg tagaagcaga ttcaggactg gaacccacgt 46 500 
ttctgagccc ctggccttgt gcagaaagga accccaatgg ccaagcaggt gtgctggaac 4 6560 
aggtgcctgg ctccacaatt ccagacagat caggtgctaa tggtgagcac ttaacacctt 46620 
tggagaggcc tcctgggctg ctgcccaagc taagcttcta caaggaaaaa tgccgaggac 46680 
gtaatctgtc cctggggagg ccctaaaggt tatctgggag accggaaatt tctagtgact 46740 
gaggcacact catggtggga aagaggaccc ttagcagaag cagggaagct atcatagttt 46800 
gttttctgta gccacagact gggtaactta taaaaaaaat agtttattta gctcacagct 46860 
ctggaggctg ggaagtccaa gggcatggca ctggcgtctg gtgagggact ttttgctaaa 46 920 
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tcatgacgtg gtagaatggc atcacatggc aagagggcaa gagtgtgcca gctcaggtct 46980 
ctcttcctct tcttataaag ccaccagttc catcataggg gcacaaccct gatgacctta 47040 
tttaatttta attacctacc aaaggcacca cctccaaata ccatcaacat atgaatttag 47100 
ggattaaatt tccaaaacat gaagttgaag gatgcattca aaccatagca gcagctgacc 47160 
tgagccgggg acagacccag gtttgaagct ctactcaccc atgagtggct ctatgaattt 47220 
gaacaagtga cctcacctcc atgagcctga tttttcatct ataaattggg gtaatgatgg 47280 
ctgccctgtc cagttctcag ggctagaatg aggcttaaca gagataataa tagctcccac 4734 0 
ttggtaggct tgtaacatgg gccgggcact gtgccaaaca ctttccctgc atcatcgcgg 47400 
tcaatcctga ccacagtgct ataaattggg cagcagcatc ctgcctaaag cagaaacatg 47460 
ggctcatcca gatcaagacc cttgctgcaa ggtcacacag gtgagtaaat ggtagtgcca 47520 
ggatttgaac cctgatctga ctccagactt ccccttccac tggaatcact ttgttagtat 47580 
gattattatc agaggccatc tcataaccag tgtattaaat cacttaattc tacaccttgc 47640 
tgcagaatgt aagtcttccc cttggactcc aggcaagggt tcaggggctg agttgaaggg 47700 
cttgctgatg cttcatttga gacagggctg ggaagccacc tgcttttata aagatttttt 47760 
tggaacacag tcatgtccac tggtgtatgt attgtgtatg gctgcttttg catgtgccaa 47820 
aaactttacc tgcatcatca cagtcaatcc tgatcacagt gctaaaaagt aggcagcagc 47880 
atcaaactca aagaggtttg agttgagata gagaccatat ggctcaaaaa gtcaagaaaa 4794 0 
ctatctggct ctttacagaa aaagtctgtt ggcggctgat ttgaaacatt caaggaagtg 48 000 
tcctggatgc ggcttgggcc tttggctgaa gtccaagaca gcggcaaggg gagtcccttg 48060 
ctgggctcct tctctgagca ctggccactg cataggaaaa gctggatttt caaaggagtc 48120 
ctagagaaga agccttcctt tcaagagtta ccaagaaatg ctggctgagt cagcggccct 48180 
tctcaagcca ggagaggaag atactgagta aatctctgcc atctcctttg gcagttagct 48240 
ggcacaggca cactctggtc agaaagaaac tttggattga gaaatttcag ttgagtaacc 48300 
tttaggcttg gagaatcgaa ttctataagg acttcagaat ggctgctcag gccaggcctc 48360 
atggttccct caatacggag gagcccagcc cctacagggg gtaccaagcc agctaggcct 48420 
acagaagagt ctgaaatgtt agatttctaa gcagttatgg ctcattttac accacacctg 48480 
gggtcctgaa aagctgtgtg tggtctgagt ctgtttgcct ccctaatctc tgtcatttca 48540 
gaggctgcct aattccggat ggatttgaaa gagtgttggt tttcagattc tctgccccca 48600 
ctcgtcaacc tctcagttca ggacctccgg tccagaaaca atcagttaag tgaactccta 48660 
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tttcttttcc taagctctgc attttgaaaa gcaatgccct actctgcttc tgttgtattt 48720 
atttggtcta gatatctcca aagggtagta gcaacgcata tcatatgttc agccttcttt 48780 
tggtttgagg tctaattatc tttacagaag ccaccaactt tcttttcctt atcccctgcc 48840 
ccaaggaggt cacacttgga tccacccaat agtatccagt tgactacgta ggttccatag 48 900 
accctggtgg ctgctccccc ttccttgtcc ccgccgtctc tgcccatcat cctttcacca 48960 
acttaaccaa atgctattgt gccttctagg gcctgttttt atcactcagc ctctattagc 4 9020 
tttccctgat cccaccagtc cacactgtgc cctcttgtac tatgagnttg tattgtactt 4 9080 
actaagaata acaaacatag ctgggcatgg tggctcatgc ttgtaatccc agcactttgg 4 914 0 
gaagccaagg caggcggatt acttcgggtc aggagctcaa gactagcctg gctaacctgg 4 9200 
cgaaaccctg tctctactaa aaataataat aataataaaa ataaataaat aaataaaaat 4 9260 
acaaaaaatt acccaggcat ggtgatgggt gcatgtaatc ccagcttctt gggagggtga 4 9320 
ggcaggagaa acatttgaac ccaggaggtg gaggttgcag tgagccgaga ttgcgccact 49380 
accttccctc cagcctggga gacagagcaa gactgtgtct cggaaaaaaa aaaaaaaaaa 49440 
gaatatcaaa ttgtacctaa ttcttctttc tagttagggg accttctgta gtactgtaga 49500 
aagagttcgc attctggaat ttgaagacca gattctagat tcacttatct aattcactgt 49560 
acccttatgg gtcagtatta agttgtcctg ccccatgccc cactcccttc tccttgcttt 49620 
cacgtgaatt tagaactttg gggatgatta ggcaggtaac agactaatca gcataaagcc 49680 
agtccacctg aaacctctgc ttaaaaccag ttccacctgt ttigcctaaaa gtcaaactag 49740 
taattttaaa ataaggcatt ttgcaccagt tttcctaaga ggcacttcca atgtaacatc 49800 
actttttttt tttttttttt tgagatggag tctggctctg tcacccaggc tggagtacag 49860 
tggtgtgatc tcggctcact gcaaccttca catcctgtgt tcaagcaatt ctcctgcctc 4 9920 
aacctcccga gtagctggga ttacaggcac ctgccaccat acccagctaa tttttgtatt 49980 
ttcagtagag acggagtttc 50000 

<210> 4 
<211> 50000 
<212> DNA 

<213> Homo sapiens 
<400> 4 

accatgttgg ccatgctggt ctcaaactcc tgacctcaag tgatctgcct gcctcggcct 60 
cccaaagtgc tgggattaca ggtgtgagcc accgctccca gccgtaacat tgcattttta 12 0 

36 



BNSDOCID: <WO 9911799A2J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11799 



PCT/US98/18426 



aatcactatc acaacttgca gcattttgtc aataatgaga ttgttttgta tgcttgtggg 18 0 
tgactttttc tatttttctc tcctattgat catttttatg actttttagg gagtttgttt 24 0 
gaagaccctt tctttaaaat tgtatatttc tagcactaaa tacttgtcaa atctatagga 300 
tttcccacga gatttatcaa atgaacccat tttgatccct cctctctgct tctcactgag 360 
ttttgctctg tactgtgtat tcctgtgtgc acctaatggg tttattaacc aaattagcat 42 0 
gtacccaatg ttcccttctg ttctgcaaga tattggcatt ttgacaacaa acgtgaagtg 48 0 
atgagacaga tggggcagaa ttatattgaa ctctttgcat tttatttttc aggctatctc 540 
attcatcaag caactctact ttgtgaaaca taaaatgata caaacaaagt cattatcaaa 600 
tgtatttatt gctgaaaaca taacattttt caagaaaggc aaactggcta aaaagtcctg 660 
aaagtgttca aaagtagata aaatagcaga agacacccac caaggataca aaacaatttt 720 
tagtagcgat cctgcattta atcagtatgt tcttaaataa actgctttaa aaaaattctt 78 0 
caaatgacac tgccaaaaaa attaggacac ccaaacagat gccagaaaac ctgtaagtgg 84 0 
gctggatttt atgtgacctg gtcattaagc atagggattc attttgtgag ctcatggcaa 900 
atcagtttga ggccagttga cctagaaccg atttgggatg caggcactta cctccctgca 960 
cgtgcaatat gtgggcatgg ggacacccat gacccccagg ggaaaggaat gcccctcctg 1020 
aacatgacct ccaagagtaa gggcgaactg tcagctttta actgtttatt ataaagacat 1080 
atttacacag aacaatcttt acaaacattg aacacagggg aagggaacaa tttcttaatg 1140 
aacagggcct taatatcttt gtataaatta gtataagaat cataaacaac cactttaaat 1200 
aaggcagccc ccctagccca cccactaccc tcttctgttc cctatctccc agctttctta 1260 
gccatccccc actttctccc cttccccacg gggctgggct tggctgcagg tcatggcagg 1320 
ccgatgaggc aggagacaca gaaaggaagg gggaaagaag gcccaatccc tgatgggggc 1380 
gtcagtggca gaagagactt tctgggcacc gaccagtccc cactccaagc atgagccttt 14 4 0 
aagcagcagc agcagcagca gcagcgttag cagcagcata ggtaaagggg cttgggggag 1500 
gtggataggc aaacattggg gctattgtgg gacttggggg gccctgactc ccccgtcccc 1560 
acacacacaa agttgggcat caggctcttg tcttctcttt ctccctcctg ggaaccctgc 1620 
tcaagcaaaa ggggagaaag ccccctccaa ggaatggctg gtgatggccc ctcacggaag 1680 
ctagggcctc ccggggagag ggtgctattc ctgctgcact tcctcccatc tttctttcct 1740 
tccttctgtt cctttgcttt cttttcttcc ctccttcctt ctgcccttcc cttccttctt 1800 
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ctcccctctc cgcctccccc aaaggaaaag ccctggaagg aaggtccggt caacacgaag 1860 
ggaaggccat ggagtccagt gattgaaggc tacctcggac tcctgaaaac caccctgggg 1920 
ttgagcggta gtctcagtgc ctgagccgcc cctattagag taccctgggt ctgggaatgc 1980 
tgccagttat gggggcagct ggccagttat ggaaccttcc agcccagctg ggggaatggt 204 0 
gcagcagggg taggtcaggg aggtgggagc agctccagcc ccacaacagg acagttcaca 2100 
gccagcttgc ctctcccttc ccttcccctc ccagccaccc ccagccccag cctcgggaaa 2160 
aggcacttca tttgctttga aaagacatca tcaagaggga agagggcgtc ataaagtagg 2220 
agatgggaga cctggtcccc atcccggctc tgcaatgact ctgagcaggt cactcccctc 2280 
tccaggcttc tgttttgcag gcgtttgcct aagaaggcta agtgaccttc aagggtcctg 2340 
tcagctctaa cattctgtga ttcttggcaa aaacaacctc ttgcttggct tttactcctg 24 00 
ggagggtata tagccatgtc aggcccagca ccctcccctc ctgacctgga gcttgggcta 2460 
gctggggatg gggtgtgggg gcaggaagaa gggagtattg ggaggcacac taaggcaaga 2520 
gaagtgacaa aggacttcac gggcccctct ctacctctcc cttaactgag caaacgctga 2580 
tgctccaccc acttcaccag agctcctgaa aaccaccctg gcacttccat gcccccaatg 2640 
tgccctgtct ggatccccgc agccagcacc tcttaactag agtctctcct tgcttttctc 2700 
tgcattcttc cctggaggcc aggtgagggg tccaactgac aggaaaacaa gggatctgct 2760 
ggagcccaca gaagggagca cttccacccc gcgctcaggg cagacatgag gaaggaaggc 2820 
ccaaatgaag gtttggggcg ttagatgaga caggcaggga ctagggcgga ggggacctgg 2880 
agaagaaggg aggctttctg gggctaggtc tccaaagtca gtccagggag gggccaggaa 2940 
gatggactgg acggattctt gtgagagaac gggaatcatc caaactacag ccaggagcac 3000 
gccactgggg gaagcaggca ggtgagaaga gctgggccca ctggtggcta cagcaggtgc 3060 
gaggtgagag ctgaatggac gtgaggcctc cagagaagca gaccaatcta tggaggagac 3120 
ataaccgccg ggggtgggca cttgggggcc ccttcagtcc taaggagaca aaaatgacaa 3180 
gagagaagtc atggacatat cctaggccaa aggaactccc caggaaagga gaggagagag 3240 
gagaccgcct tctctttccc ccaaactcca ctcagcccaa acctcaatcc caaggcccct 33 00 
gagttggtgc ctgcctgtcc ccactgcagt gggacagcca gcagaccagg gaaggggcag 33 60 
gcttcagctc tcctctctct gctgccacct tggttcttcc ctccttctca cagctccggg 3420 
caaccacagc cacaactgtc ctcttgttct ctgcctgaga gcccctagag ctctctgcct 3480 
ccttccccaa ctctggcgac aatctacact ggacatctgt aggtttgccc tgggcacccc 354 0 
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tccccagcca ttaaggtcca ccaggatgtc tagaaatctc taagcaggcc agcctcccca 3600 
accccaggca gcagggtgga agggagactg gccccaagta tcaagcccct ctctaggcct 3660 
cagacgagga gatcgtctgt aaaataaagg ggctagacag ctgccccata agctctcttg 3720 
ccagctctaa gcctctgtga gtgtggcgcc aggactgttt gatggatgtg tctggccagt 3780 
gatgagcggc aggatcaccc ggcttctagg ctgtccttct ccctccgcct tagcactgtc 3840 
cactgaacag aggctcaagt acctgcttca gaaaggcatg ggtcccttat gggagaggcg 3900 
gggccgctgg cggcggaatt tcctccgacc tccctgccag ggccctggcc cattccttga 3960 
ccctctgggc tgcaccaggt ggcggacatt gccgtcttcc agcccattcc catcgggaag 4020 
cggcatcagg gaccctgcag gaaggagaaa gcctgttagt gaggaaggtt gttggaaacg 4080 
ggcaggggct gcagaccaca ggcccgcggg aggggtggtc tgtccatgtg gtggcacttc 4X40 
tcacagattt tctttgtttc tttttttcat attgagggaa aatttgccat ttctgcactg 4200 
tcctgggaca aacggaacct actactcaca tcttgccatt gatgtagctg actttctgtc 4260 
cctttgacag ttatatcatg tttctgacag gcacaagtgg gttctttctt tggaacgtca 4320 
ctggggggtg tgcactttgg gttgtcacgc ctgtacttgg ctggggggca ggaggcagta 4380 
atcaccacgg cacccctgca actggtacac ttcaaacatt tcagacatat gacattacaa 4440 
tgaccctccc accgccatgg acccatcacc cagattcaac agttatccag atggagtcac 4500 
atatgcttta gttatccctt ttctattttc cttgactgca gtattttaaa gcaaatcccg 4560 
aacatcacgt cattgtactc ctacatcctt tgatatgcag gatgacccac ttgtgagtac 4620 
tagggagctc atatctcgcc cctgggtgtc tcatcggtgg ttaggcatct ctagggcgcc 4680 
aagtcctccc agtggatggg gctgggccag gcgggttaag ggggagaggt ctctgactca 4740 
gcctgcccca tgctgtgacc agatttagag acctgagacc agatttagag accaataaat 48 00 
attgcatgaa tatttgagaa tgaataaatg aaacaatggt caccttcagc agcatgtctg 4860 
tgcaccagga ggggggtggg ctgttgcagt tggtacctcc ccaccaatgc cctacctcct 4920 
gaaactccca gatcacaccc tgcatcctgt gtctcatgta aattccttcc atttcaatcc 4980 
cagctggctt ctattctgca ctctggggct cctctttaca tctgtccaca tctgtcagga 5040 
ggcttagagt agaggcctcc tgacaccaga ctgagccctg acaatggaat ccaagtggtg 5100 
cagggtatcc ttgggcgcag tgagggaaaa ggcatgcagg ccgactctcg tgaagcctgc 5160 
agtccatgca gggaagctga gctgccaccc ccaagacaag cctggctggt ggtggcaggc 5220 
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agtgtgtgga gatgagatgg tgctggggaa gaggcagcca ggcagggtac ttgtgcaatg 5280 
tccagtacag gcctgtgcag atacagcatc tgaggccaga gataatgaat aacagtctct 5340 
ctccccctgc acaccatctg atttgcctgc atctagatcc ctgtccctcg agccccattg 54 00 
ccatctaccc aggaattccc accacttgct cacagtttcc atagagcctc ccattggctg 5460 
ctcacacccc aatcataccc ctggcagctc ccccttcccc tctgcctctg cctccaacat 5520 
ccccacgctc tgttccttcc ttggctcctt ggtttttgtg agtcctcctc tgcttgcaca 5580 
caatttccaa gtgttcagcc tcctcttggg ccaaaagtgg gtgggtttgg ccttagggcc 5640 
ctcttccccc aggcttctga gcctgcagtt ctccccactc acacccagac atacccaccc 5700 
cctccacaca cacacacata aacacatatt ctccaagcct gcaccctcct tccctctttc 5760 
cccaagaaaa attgccaggc ccatgaggac cccgctgttg acttgggtca tttggttaac 5820 
tagccacact tccgtgccaa cttatccctt aaattcataa cctatgagaa ttaaaagaag 5880 
aattatctta gcactttgga gcagtaagta tggatggaac aagggggctg cttctgtctg 5940 
actgtcatag agctaccctc ttggcgccat ctgtgagggg gctggggctg gagagggaaa 6000 
ggaaggaaag accttcacta agctccccac tgcagctgct tcagcctctg ccttgagttc 6060 
tggggtcccc acaaggctca ccctgatcac tgagggcgga tctgaatgat gctgataagg 6120 
agcaggggaa ggacctaggg gccttgtgaa gtgaaccaag tcaacacttg gctctcacct 6180 
ctgcaccttc gactgctggg ggtgcagaag ggatggggca gtggtgcagg ccattgtagc 6240 
tcagggtggt ggcagacatg gagaagagaa gcatggtgcc aggaggggga gggcatcatc 63 00 
tccaggtgct cagattgtga gtgtctggca gtcagaaagg gagggctaga ggccatatgt 6360 
caggatgggg agcagggttg gtggctctgg ccaggaagat gtctagactc taggtctgca 64 20 
gagaagaacc actctgagct gactgaaccc ccagccgggc cctggagagc age cccaggc 64 80 
ttcttgeate cctcccattg cgttggggac tcattgtcct cagcatcacc gagatctget 6540 
ggctcctccc gcactctgag cccgtcaccc acctgtttcc tacttttgat tctgagtctg 6600 
gctcaccatc agataagctg acatctgaaa ccctggcttt atgctgtcca aaggaacaaa 6660 
ttagaggctg ttgattctca teactgeaga agtcttgact gtaaaaaaga attccccttt 6720 
agaaatggag gccagcccta atgcacttaa atattttctt ccagctcatt tgctacgtaa 6780 
aaggaggctg gagcaatcct ccaaacaatc tegagectag agagattgga gccattgcct 6840 
gctgaagtgc catcagccac gaagctacaa catctccacg ctgagaacca cagtcagggg 6900 
gaggacagga tttgactttc atgctatctg aactccaacc ctgccaatgc acaaagaggt 6960 
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acttgcgact tttcaggaac atgccagtct catcaggaat gacgcacctg tttttcctct 7020 
ctcagtctgg cctgtcttgt gatgacttta aaagcccaaa cgtatgctcc ggtggggacc 7080 
ctgctggcaa tgcccctcca ctgctctgtg gtcctgcatg tgtctctttg tctgcccagc 7140 
cccccatccc agtcgccttc tccagggccc tgtgctgtct ccagtgccag ctttgctgcc 7200 
ctgggataca tcatgctctg tctggtggcc ctctgcttcc cttgccttag ttctttgcga 726 0 
ggagaggcct ggctcaggtt acacgggtgc ttagtgctga ggactgggga tggagtagag 7320 
atgcccttcc ctgagcagga ggggtacagt aggctctcct cacactccca ctgtgactgc 73 8 0 
aggctttgag ctgggcctgc ggagggggag gagaagggga aaaagagggg tgctcagaga 7440 
ccgagggaca cttggccctg gaccaggagt tctcagactc acaagagcaa agtgtatccc 7500 
aggtcaggag gcagtggagg agtgtacgtg gtggtagcgg ggt gat gaga ggcagcgtgg 7560 
agaggggtcg ggggtggaca tttcaagtca ggacagactg tggagaggag gaagtgtggt 7620 
tttcaggctg gaactttgac tggggactct ggctggggac agcatgccag tccgggccaa 7680 
ggacaaagtg gttctgggat tcctgcttct aacccaggac aggagccagc gggagcagga 7740 
catgtgtacc tggacccttg gctggtggcc acttccaggg acatgccctt ccttggct cc 7800 
aggagaagtc agggagtaga gaagtcagtg agagccctga aagtttccag cacgaggctt 7860 
gagcaaaatg tgagggcaac gaatttatcc aaatcacagt ggacagatga cctaatgaag 7920 
acgggggaat taaagcacga agtggcaggc tctggaccag ccacatggtg ttcgggcatc 7980 
ctggctgact tcccttccct tgaaaccagc tctctcccct ttgctagtac tgtcccaatg 8040 
gctgggaccc aggagggtct gagtggttgt ctattggcca ggggtctccc cacagcctct 8100 
ggggggagca ccaagtgtgc agatccagga gccctacctt cctctggaac actctctgca 8160 
cccagtcttc ttcttccctg aaggcaccac cctggactat ttgcctgtgg tggtattgtg 8220 
caagtgtcat gccttgccaa tgacaccaga tggtaggctc cctgaggata ggctggtatg 8280 
gtgtatcatt cgtatcttac attcgtatct tttgctccct ctgttacctg cctgacagga 8340 
gtttctgtgg aaaaagccta cccacttctt actgtggtgg tggctcactt cgtgttcagg 84 00 
tcttgaatag agaaatcacc ggccagctac ctggaggcag gaggtgccag ccccaaccac 8460 
tgcacctttc tgagaagcca ggcagtgttc ccagagccac agcagggcca aaaagcaaga 8520 
gcagagaaag gaggtggcct gcgatgagag gcaggcagag ctggctgggc ccctcggagg 8580 
ctcctgggct gcatgccatc ctcctgttct: ggagggtttg gaaccactta gggccctgtg 8640 
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cccttgcccc aggaaactca ctgccctgcc ttctccttct ttctgctccc acctccctgt 8700 
gactccagcc atggtcctgg cgttagtcca cctggtcttg gccttcccct tgtgtggtgc 8760 
caggcaggca gcaatgacag ccagatcata ggactgtggc agctggaggt gggagctggc 8820 
agccccagga gacattgaca cagaggacag gcagcctggg atggggctgc tggggcgtgg 8880 
ttggggacca ggctaggggc ggacatgggc actagtgcca agtattggca ggtgagggca 8940 
aaaggactcc cctttcctga gctgcaggga ggggtcgggt caggtgctgt gcttcctcct 9000 
ttggtgccta gcggcaggga gactaaagtg aagcatgtcc gtgcctggga cagaaaggaa 9060 
ggctggagcc aggatgtaag agaaccaagt ctctgggggt gggatggagg ctatggggag 9120 
ggcatcctgt gcaggggagg agaccagcca ggaccttggg gttagggagg agaagaccag 9180 
cccagcccgg ctgggcccgg ccctgcctgg gggaggctgc ctctgctcac acatgcaggc 9240 
cgaaaggagc aacagctggg ctccatgccg ccaccccctc cgcgcactcc tgcctatgca 9300 
acaagtgtca cgtctgcatg ttggcacatc atccccggtt tcccgcgccc ctggactggc 93 60 
gggaggctcc cagccttcag ggaccagaag acgttcaaca tgggagccca gcccactcga 9420 
ctctggtcag ttccttccat cgatccacga gggagcgggc atgtcccccg cctccacctc 94 80 
taccacgcgg ggtgcagggc gtgggacacg cggcgcacac ctgtggtcct gagctcctgg 954 0 
gactgcgagc gacggttagg agggacaagg tgacgggcag gtgatgccaa aggccgagtt 9600 
gagccccgca aaagaagcag tccttgtggc cagccgcccc atggctcggg gcgctctgtc 9660 
aatctgctgc ctggcgctgg ccggcgcgtg gctgctactg cacggctcgc gccgggctcc 9720 
ccgggaggcg gggagagtgc gaatagggcg gagggaaagg agcacgccgg ctgcagcccg 9780 
ggcgagcggg agggcgcgca ctcacctcca cacaccgcgg tcaaggagag ccagagcagc 9840 
aggagcgcct gcacgcagag ccgcagattc atgctgctcc ttgggccgcc gcggccccgg 9900 
cgagccggcg cgggggagga gaggtcgggc gcccggaggc caagaaaggc gcgagccgcg 9960 
gctggcgcgt gcgggcgcag agctcgggag gctccccggc cgctgagtgt gcgcgctgag 10020 
ccccgccgct cccgctggcc gcctccgctc ttctgcagcc tcctctcccg ccgcggggca 10080 
gcgccgcgaa gctggcctcg gcggctccgg gagcggcagc ggcgagctct ttcttagcgg 10140 
ctggctgctc ggccgcggct gcaactgccc gtgaccccgg ctgccagaga gaatgctccc 10200 
cgctcactcc aggggctgca ttttgtagct tgtggcttgg ccgcgagccc acttggtcat 10260 
gtggtcatcg ggagggctag agggggggca ggaagaggga gagggagcga gcctcccgca 10320 
ccgcccccct ccaggcacgc actctgcagc cccagcccga gcgtgagcgc gagagggaac 10380 
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cccgaggtgg ccccacaaca aaggctgcgc ggcctctctg acaacctaca accgctcccc 10440 
ggacaatgcc cgcctggccg ggggccagag ggtttgcaga gacctgaagg atttcaaaca 10500 
caggaagcat ttggggcggt ggaggggcac acttaaccct ttctcacctt gcttccttaa 10560 
cagctgtcag gaaacctggg cgcttaggga aggactcagt ctgggttgcc ttcactcacg 10620 
ccctctacct ctgaccctgg tccttgccct tgccgctggc cagcttcaag ggtttggtca 10680 
agacaaccaa acactggccc tcctctctcc tctgcatttt caactggctt tgaattgggg 1074 0 
gtgactgcct aggcttgtgc ccaaggcttg attaattgcc caggcccctc atgttccctg 10800 
catcccccag gaggcaccct ctacctggaa tgcccccacc ccacccttcc gtgagcaatt 10860 
cattctttgc agcagcacct caaatcccat cgcatccaga agccttctca ggataacact 10920 
aatccagcag gggccgccca tcagctcctc cccactgttg ggacatggga cagctcctat 10980 
ggtctaccct ttcttgttcc ctggagctgt cctcattcta gttccccatc aacgtaagag 11040 
gcaggtggtt tacccaactc cacacagtca aggctggatt cattttgcgt gtacttgcga 11100 
ttccgcttta gaagcgggtc ctgaagttgt ttgcctttcc agactcgatt gttagctgct 11160 
tctatagctt gggagctccc aaggcatggg gctgcttccc tcattggact gggaaccccc 11220 
taaggtagga tactgtcttc tcactcttct ggatctctcc aaagtcccca atatcttgct 11280 
ctgcacatag gtggcactca acacatatta aaagactgaa tcgaaaggtg tctccacctt 11340 
tcagcaagcc aggcagtttg cttgaagtgt taagtccaga tttacacaaa agatacattg 11400 
gacagggtgc gtgtgtgcca taaaagacat gagagaccat ggcatattta aaatatcttt 11460 
gaaatgttgc aaaaatccta tatatacaca atgtatttat tatcagtctc tctccaccag 11520 
aagggttacc tccacaaggg agggatcttt gctttgttca cagctgtgtc cccagaccta 11580 
gaacagtacc tgcgacatga ctggtactca atacataatt ttgaatatat aaatgaaatt 11640 
ttcaaaaaca tctttactgc ctaaattcag gtagcaggaa gggaataaga taaggttctc 11700 
attaatagcc tatggcccat taaccacact caagtaaaaa aaaaaagcat gtgaagcatg 11760 
gcacccctct tcatccttga gacccaggat atgagtagca gacatgggta gcaggagacc 11820 
attctccctt cactgttccc ctggaaagag tttgctttct ttctcttttc tttttctttt 11880 
tttctttttt ctttctttct tctttttttt tttttttttt ttgagaccga gttttgctct 11940 
gtcgctcaag ctggagtgca ataccctatc tcagctcact gcaacctccg cctcccaggt 12000 
tcaagcgatt ctcctgcctc agcctcctga gtagctggga ttacaggcat gcgccaccat 12060 
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gcccgactaa ttttgtattt ttagtagaga cggggtttca ccaggttggt caggctggtc 12120 
tcgaactcct gacctcagat gatccacccg cctcggcctc ccaaagtgtt gggattacag 12180 
gcgttagcca ccacgcctgg atggaaagca ttttctaacc aggaggagaa acacctggct 1224 0 
tctagtctcc accctcattt gctgtagggc cttgccatca tcacctaaac ttactggggt 12300 
tcatggaggt gataatctct gctctaccca cttcaaagtt gttatgggat taagtgagat 12360 
aatgtgcctg aaggtaccgt gcaaattaag aataagctct tgcagggttg tttttaggaa 12420 
acttggtgct ccaatccttg attgggcaag cccatgcaca cccaaataag tgtcccccat 1248 0 
aactaggtta atagaagcct tcatatttaa ttgcaccttt actagggtgt gtatggcata 1254 0 
ttgctaagtg gttggaacac atagcgcttt ctaatacaat agccactaat catatttcca 12600 
atgaagttta aaattcagtt cctcagttcc actggccacg cttcaattgc ctatatgtgg 12660 
ctagtggcta ctgtattgga cagcacagaa tagaatcttc aatcactgca gaaagttcta 12720 
ttggttagag cattcattca ttcttctttt ttctataatc cattttccca ccctacctag 12780 
ccttcaaaat gtacaaggtg aggggaaaaa aagcataaaa gcaaaccaat agaaattcaa 12840 
gccaagctga aggcatgcgt ttagagtaag atttaaagat ctgacacttg ggagaatcat 12900 
tttttaattg aatccttctc ttcatttaac caaattacat ccattttttt ttagaggcag 12960 
ggtcttgctc tgttacccta ggctgaagtg cagtagtgca atcacagccc aatgcaggct 13 020 
caaacttcta ggctcaagtg atcctcccac ctcagcctcc caagtagctg ggactacaga 13 080 
catgcaccac cataccaggc taatttttaa atttatttat attttttgta gagactgagt 13140 
cttactatgt ttctcaggct ggtcttaaac tcctggtctc aagcaatcct cccatctcgg 13200 
cctcccaaag tgctgggatt acaagtgtga gtcaccatgc ctggccacat ctgttttcgt 13260 
agtcccagat ccaagatttc atctttcttt ctcaggacac ccttgcccac agtgactttg 13320 
ccctctccat aacagctatg gctgtacctg tcattggtag gtcctctcac agtgcatcag 13380 
gaacactttt gcatctgcac actgtgctct acgggcctgc tgtaattctg cagagaggac 1344 0 
ccatcaccac catatccatt caggggtcaa ggcccatgag gcaaggaacg cagctctgat 13500 
tgcccactct gaaccccttg taggctccca gatatcttaa tatcagattg aaatgaccag 13560 
ttttcgaata tctgtatcta tttgccagac tttaaggtca ctgctgtgat agcacctagc 13620 
acaatgcctg gcacataatt gatacttcat aactgcctgt caggtgagca actgtcggcc 13680 
caacactgtg ctgtcttttg ttcaggatac tattgaatga atatcagcag gataaatgtt 13740 
ttaaggcata agtctccaat ccctgggcca tggaccacta tggctccctg ttctattagg 13800 
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aaccaggcct cacagcagga ggtgagcagc agatgggcca gcgaagcatc atctgtattt 13860 
acagccgctc tccattgctc ccattaccac ctgagctcca ccttctgtca gatcagcagt 13920 
ggcgttacat tgtcatagga gcacgaaccc tattgtaaac tgcatgtgtg agggatctag 1398 0 
gttgcatgtt ctttatgaga atctaatgcc tgatgatctg tcactgtctc tcatcacccc 14040 
cagatgggac catctagttg taggaaaaca aactcagggc tcccactgat tctacattgt 14100 
ggtgagttgt agcattattt cattatatat tacaaaaaaa aaatagaaat aaagtgcaca 14160 
ataaatataa tgcacttgag tcatcctgaa accatccctc ctaaccctag tccatggaaa 14220 
agttgtcttt catgaaacca atccctgatg ccaaaaatgt ggggaccact gtttcaaggt 14280 
atttctctgg agatgggaag gaggtacatg aaactgtcca cactctcctg ccctgaggca 14340 
gcaagtagct gtaaaatttc tggaaatgtt gctgcatttg gggctcagga cctgtggtca 144 00 
gcagcaagac tcagtttcca gtcaaggtga gggacaatcc tgggggcaag ggtaggactc 14460 
agtgcattgt tggggacaga agtgagatag ctgacctatg tgagttcaaa gtgtgacttg 14 520 
cccccctcat cagagctatc cacatgctat tcagttctgc ttattgtcct atgtcaccac 14580 
cagccactgc tgttcagttg actcatccac gtctcttgtg ggttaaagtg aacagtgtga 14640 
ggcattaaaa agaactcagc ttttggagtt tcacataact gaattcaaat cctgcccctc 14 700 
aaccccattc gttagctgtg tgaccttggg caagttattc agcttctctg aggctcagtt 14760 
ttttcctctg ttaaatgagg ataataatgg tacctaattt tttaagtgct aaggtttgaa 14820 
tgtatgtgtc cctccaaaat tcatatgttt aatttttttg tttttgtttt tggagatgga 14880 
gtttcactct tgttgcccag gctatagtgc aatggtacga tcttggctca ccgtaacctc 14 940 
tgcctcccgg gttcaagtga ttcccctgcc tcagcctcca gagtagctgg gattacacgc 15 000 
atgtgccacc acatccggct aattttttat atttttagta gagatggggt ttctccatgt 15060 
tggtcaggct ggtctcaaac tcctgacctc aggtgatccc cccgccttgg cctcccaaag 15120 
tgctgtgatt acaggcgtga gccaccacgc ctggcctcat atgttgaaat taaaagccca 15180 
a<?9tgatgtg ttaagaggaa gggccttttg agattgatta agtcatgagg gttccaccct 1524 0 
catgaattaa ttagtgctct tataaaagaa gttggaaggg gcaccctagt tccctttttg 15300 
cccttctgtc ccttatgtca tatgaggaca cagtgtttga acgctccaga ggaagcagca 15360 
ttcaagacac catctagtaa gcacagactg ggccctcatc agacacagaa actgccagca 15420 
ccttgatctc agacttccca gcctccagaa ctgtaagaaa taaatttgga ttatttataa 15480 
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ttatccagtc tcgagtattc tgttatagca acaggaatgg acaaagacat tatggttggg 1554 0 
agattaaatg agataatgtg tgtaaaatac ttagcacagt cccctgctca ttggatgtta 15600 
gttttctcct cttcctgacc tgcttctccc cccacctttg ggtctatcct ctggccttct 15660 
gaaactgatc aggaagaaag ggctctgata tggtttggat ctgtgtcccc accaaatctc 15720 
ctgtagaatt ataatcccaa gtgttggagg tggtgtctgg tgggaggtga ctggatcatg 1578 0 
ggggcagagt tctcatgaga ctgggtaatt tataaagaaa agagttttaa ttgactcacg 15840 
gttccacatg gctggggagg cctcaggaga cttgcaatta tggtggaagc caaaggggaa 15900 
gccaggcaca tcttacaagg cagcaggaga aagagagagg gagggggaac tgccaaacac 15960 
ttttaaacca tcagatctta tgagaactca ctcactatca tgagaacagc atgggggaaa 16020 
cgaccccatg atcaaatcac ctcccaccag gtccctccct caacatgtgg ggattacaat 16 080 
tcaagatgag atttgggtgg ggacacagaa ccaaactata tcactgccta acctcaaagc 1614 0 
ccacactctt tcctcactga acatcaagaa gcagtttctc gggaggctga ggccggggaa 16200 
tggcgtgaac ccaggaggcg gagcttgcag tgagccgaga tcgcgccact gcactccaga 16260 
ctgggtgaca gagtgagact ctgtctccaa aaaaaaaaaa gaagcagttt ctcacttttt 16320 
gggtccacct atcactttat ttgaacttct agcacctaac tcatactgtc ttgaattagg 16380 
aagccatgaa gtacatagtg gttaagcaca tgggttctag aataaggcag ctctgtctct 16440 
tactacatag gtaactttga gcaaattatt aacctttctg agccttaggt gcctcatcag 16500 
aaaaaagggc tagcacctag ctcccattgt tactgaaagg attaaaagaa tcaataccta 16560 
taaagtactc ggcactgagt ttggttctta tagtaaggga agcattgtgt aaatgttaca 16620 
tattcccatg ggtatttgta tatatctctg acttatatga gatagtaaca ctatgtctta 16680 
ttcgttttta tattctcacc agcagtgctg ttgcccaaag caaatactca ataaatgttt 1674 0 
gttgaatgaa agaaggaatt gtatattctt gttaataatt gctctaggtt tatgatccat 16800 
atcaagtgtc tctatttggg caaaatcatc acaacttgtc caactggttc ttagatgggt 16860 
caacactaac agctaacaga catacacaaa gatgtgacca tcttccctct cctcattgtg 16 920 
tagtagttgt ttttactgga tcaatgcaaa aagtggagtg gggagaggac atctgaagaa 16 98 0 
ggatgaccct ggccagcaat cagagagatt tcataacagc aagagcccca accccctcat 17040 
tatgcagatg gagaaactga ggtccagaaa ggaaaagtga cttacccaaa gctacacaga 17100 
aagctggtgg caaagtcagg acggagccta ggttcaacta aatcccaact cctatttttc 17160 
cttctgtgtc ttcccagcat agaagtagcg ctcactggcc aagcagtccc ctagctggct 1722 0 
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gcataggcac aacaaaatga ctgtggttag gcacaggatt ctcattccat aggccagcgc 1728 0 
ttttggagca tctcagaatg agtcttccca ttgtcactca tctaatttgt tccttaagac 17340 
ataagctaaa ttaatgagca aaagaagttc agtagcagca acatcgggaa aaagtattta 17400 
agaagaactt caaagcttac cctgcctctg aagctactta acaggaactt gggggaaaaa 17460 
agcatgggcc agctgctagt tgatgcctca ctaactagct tcatgacaaa ttatctttgc 17520 
tcagttaacc accacaagag aacatgaatt tttcactctc tcattactga gtatttgtct 17580 
ttctgatact aacggctaga gttgagcttt ctaactcatc tataaccacc agagcagcca 17640 
gcacaatgct tgccatagag tagtagatgc ttaagccata ttcatgggtt cactttccca 17700 
ttctgggtgt gagaccattt aaatcagaga catgatgttc attatgtgaa acttagcctg 17760 
gaacttattc tgtgaatcaa aagatcctaa taatcatttt gagggaataa tctctttgtt 1782 0 
ggactttgct ggggtttctt cttcagctaa ctgtgtgatt cttgtcttga tgtccaatac 17880 
ctcttaattc tgatcaccag ctaggtccta agcctggcca cagaccgaga tcttgtcttt 1794 0 
cacacttgcc ctcttcccaa gcacacagag tcaaatacta accatcacac ggtatcaggt 18000 
ttgcaaaaat tagagaagta gccattttgc agcataacca cattatctct attcacatct 18060 
caagtgttta atactatatt caagtcacta catgctaatt ccttccatgg gacacctttc 18120 
agccactgcc acatgggctc caactgccct ggaaatgctt attccatttc tctgtttctt 18180 
ccaacttctt tgcaaactac ataccttctc agatcgtctg atatattcaa ggaacaccag 18240 
gggttcccac agatacgcct tccttgacac aacacccaag ttcctgaaaa atgttgtcag 18300 
tggaattcag ataatcagct atatttcaaa tgtctttgaa gggttcacag tttattagct 18360 
gaataagtcc taagagagat tctctgaggt gaagaactgt tcctctgttc ctgcccccag 18420 
cccagcccag ctccacagaa ttaccccctt ttccctacat cctgcttcat tccttctgat 18480 
cctctcccat agccacacag gtccccattc ctcctgtttg tctttcagag cagtttcatt 1854 0 
tcctctgcat atcctctatg acgtctccca ggagcctcca tccttcctct cccctacggt 18600 
ctataaata# cctggtctcc caggggaggc atgtgaattc cttccttacg cagcaaggcc 18660 
ctgtgagtta ggatagccac tcccaaagga gcaggtttat ctccaaagtc cattgaacct 18720 
ttcttcctcc tgctttgctg gttaatatta tctgggctta tatcatgtaa gatattatat 18780 
gtctcaaatc tggcaccttt gtttgcagag ccagggcaaa gactgttgag gacacggtat 18840 
tgagtctatg accagttccg ttatgggctt agagttctgg gataataagg tcaaagtcag 18 900 
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agaga4aatg aggtcagctt tgttctatcc caaactcaag tttggattga caccagtcac 13960 
tacctgaatc ctaaccatgg ccacagactg agatcttttg aggctctcac tttttgactt 19020 
tcatacattt cccatacaca taaagagaga gtcaaatatt gtactaaaat ttaaaaggta 19080 
acacttctgg agaaaaaaga aagaaaacta gccatcttac agcaaaactc catgtgttaa 19140 
ttcctcaggg ctgctatgac agattaccac aaacttcgtg gcttaaaaca acagtttatt 19200 
ccttctcact tctggaagcc agaaatctga aatcaagaaa gctctagggt gagaagttca 19260 
tgataaatta gcgaagtgat agatgtgtaa attagtttga ttgaatcttt ctacaatgta 19320 
tgcatagatc aaaacctcac attgcaccct ataaacatgc acaattatca tttgtcaact 19380 
aaaaataaat aaagataaaa tgaaagtgta agggaggact cttccttgcc tctttcatcc 19440 
cccggtggct ccaggcattc tttgacttag ggctgcatcc ctccaacctc tgcctccatc 19500 
ttcaccttca catgacctga cccttttgtc tatgtgatat ccttctgcct ctctagggtc 19560 
ccaccccgtc gcccaggctg gagtgcagtg gtgtgatcat agctcaccat aacctagaac 19620 
tcctggcccc aagcaatccc cctggctcag cctcttcagt agctgtagct acaagcacca 19680 
ccacaaccag ctgatttttt attttttatt tcttttgtaa agacgggtct tgccgtgtta 19740 
cccaggctgt tctcaaactc ctggcctcaa gtgatcctcc cactttgacc tcccaaagtg 19800 
cttggattat aggcgtggct gcctcactct tataaagatt gtcagcaatg gatttagggc 19860 
ccacttggat aatccagtat gtctctcttg agatccttaa acttcttaca cctgcaaaga 19920 
ccacttttaa atgaggtgtc aaatttacag gttccagggg ttaggacata gacatatctt 19980 
ttcgggggct actgttcaac ccattacacc ccactacctc catttacatt tctggcatgc 20040 
tggaatcaac acagcatgaa ctttctgcca atttaacctc caaatatctg tcaaatctac 20100 
acgtctctat gtttctgctg ccctattctt gttccagccc tgtcattcct cacacatcat 20160 
tctccatagc aaccagaatg gttatgccag ttggcttgaa aaccttctgt ggctcatggc 20220 
tcttaggaca agaccaaact cttaaccatg gcatgcaagg ccctgcataa tctagcccct 20280 
acccacttcg ccagcctctt ctcttgcctc ttatcctctt tctatctgca atgccatgtc 20340 
cctcctgtca cagtgccttc ccaatacttt ttcctcctac tgaatgattc tctgctctac 20400 
cctgccttct cctggctaat gcctatcatc aatcagctca tctctaatgt caaatcactc 20460 
ctcctcaaga gatgttctcc ttggccctca gactggctga aattcccctg taaccgcttc 20520 
tcatagttca caattcagtt tgtgttcagt taatgtctgg cccaccccat ccacccctca 20580 
ctgtccaact agactgaact: ccataagggc aggggccatg gctgtctggt ttcatgactg 20640 
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tgtcctcagc atctagccag tgtctaactc atagtaggtg ttcaatcatt ttcccaatca 20700 
atgcatatat gaatgaatga atgaataggt ctgaacatct gtgctttgct ccctcaagtt 20760 
cttctaaaaa atgagaaaac aatgtttcac tcccagggtt gaataagaat caaaacaaga 20820 
gaatgaaagg gaaggagctt tataaaccat gaaacgcctt actaatttga ggaatcatta 20880 
caatactggc caatgtaacc attcttctgg gggagtggga agtgcactgt gcacgactta 20940 
ggggtagcag gaggaacaag ggtggaggtg gggaaggata cttgagttct acattcctct 21000 
ggataagaag aatttctgtg gctctgctat gtaggataaa attacaagaa cattcaattc 21060 
ccctgcttca ggcttgaagc ttataactga tcaccctctg ggatcaagaa gaaaatgcca 21120 
ttctgctcag gtgtaaatat tgcccggagt cacctcgggt atttaggatt ttgactgtct 21180 
tcctttaaag catctgaccc aggtgcaggc tgaggatagg gtgacagcct cctgggctgg 21240 
atccctcttg gctggtgctt cccatgaggt gatccatcag ccgtggcact tgcttaaggc 21300 
actgcccagc tgacaaacgg ctttgaagtg tgccttttgg ctttgcattc ttccacttcc 21360 
tggccttctt ggctttcact gttagcaggc aggcggaggg gttggcgggg ctgggtgggg 21420 
gggcggggga ggtggtatca cctccttcct cttgctcctc tagcacgtcc atctaaaaaa 21480 
catcccagga ggcttgacat tccaggtctt tagtttctta catttgattt ccctagaccc 21540 
catcaagtca taatctcctt cctgcaggga agggatactt ccacctcttc ctccatatcc 216 00 
tcttagctcc ttgcatttgg gtcaagagac ctaaaaaggc catttcttat ttccatcttg 21660 
tcttccattg ttttatcccc tgggatatct ctgcattcat ggccctgaga tgcctcactg 21720 
gtctggaatg ctcagggcaa gccgatcctc tgaatcatag acaaagaaac tggaaagagg 21780 
agtgctcatt aggctgtcat ttccccagtt tcctccactg acctggatgt ctttccactg 21840 
gtttggtggt gtgcttcccc actctgtccc actggatttt gaggttcagt ctacacagcc 21900 
ccagagaggt gaataacaaa agcatgtcta cgcataat cc aagcaaaatc ctcagttttg 21960 
tctcaatgat ttgggcatcc cagtacaaag gactacttca gtagcaggct ttgggataat 22020 
tttctctaat agttaccaca gaattgaagc tggaaaggac cttgaagttt atctgttcaa 22080 
agacaccctc cttttttgtt tatagcgcta tagaggtata attgacatac aatacctgca 2214 0 
caagtttcat gcagtgtaca aaagtgtaca atttgctaag cttttacata tacatactct 22200 
catgaaacca tcaccactat caagttttca tcaatcccta cagtttcctt ctgcctcttt 2226 0 
gtaaactctc catccagccc ctcccctctg ctccatctcc aagcaatcac ttggggataa 22320 
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tgctttcaag cagtcacctg gggatggagg agaggggagg ggctggaggg agggtttact 22 380 
tgttaatata tagtttgcat tttctagaat ttttaataaa cagaataaca caatacgtac 22440 
tctgtrttttg ctggcttctt tcacacagaa aaattatttt gggatttatt cctgctgttg 22500 
tatgtattaa cagtgttttg tgtatcagca ggtcattact tttgaatggc tgagtggttt 22560 
ccaccatatt ttgttcattc attcatgtgt tgatgcacat ggatggtttt ttttttctag 22620 
atgttggcta ttgccaataa aactgctatg aatattcata tacaagtctc tgtatggaca 22680 
aagactttca tttcccttgg gaaaatacct aggagtggaa tgctaagcca tatggtagat 22740 
gtctgtttaa cttttacaac actaccaaac catttttcaa aatggtcata ccatttaaca 22800 
ttctcaccag tagtgtatga gaattccagt tcctccatat cctcaccaac acttggtgtg 22860 
gtcagtcttt tgtttgtttt ctttattgtg gtgaaaggta tatataacaa aatttaccat 22 920 
ttagaccttt ttaagaagac ttcatttaca tattttttaa ttgtaggtaa ttacacataa 22980 
caaattttaa ctgttgttaa gtttatattt tgtcagcatt aagtatattc acattgctcc 23040 
gcaaccatta ccaccatcca tctccacacc tttttcacct ttccaaactg aaacactgta 23100 
cccattaaac attaactttc catttgcgcc ttccccagcc cctggcaacc accattctac 23160 
tttctgtctc tatgactgta actactctac tacctcacat aattggaata gtacttcagt 23220 
cagtcttttt aactttagcc attcgaatag atctgtagtg gtatctcact gtggttttaa 23280 
tttttatctc cttaatggct aatgaggaaa gcatctttat atgtgtttat ttgccatcca 23 340 
gatatcttat ttggtgaatt gtctgttcaa atatttagtc cattttttat tggttggttt 23400 
tcttagtatt gaattatttt aacaacttta ttgagatata attcacatac catacaattc 23460 
accagtttaa agtgtacaat tcaatggttt ttagtatagt tacagagttg tgcaaccatc 23520 
accataatct aatattagaa catttttatt atcccagaag gaaactccat atccattacc 23580 
agtcacttcc cattccccta cccttccatc ccctggcaac caataacctt ctttctgtcc 23640 
ctatagattt gccttttctg aacattcgta tgaatggagt catacaatgt tggccttttg 23700 
tgtctggctt ctttcactag cataatgctt tcaaggttca tccatgttgt ggcatgtatg 23760 
agaaatttat ccccttttac tgccaaatca tatccaattg tttggatatg ccacatttta 23820 
tctatccatt tatcagttga tggatatttg ggttgttttt atttttgact attatgaata 23880 
acgctctatg aacatttgtg tacatatttt tgtgtggatg tatgtcttca attctctcag 23 940 
gaatatacct acaaatatgt acttgggcca tatgacaact ccatgtttaa ccatttgaag 24 000 
aactgctaaa ctgttttcca tagcaacagt accactttcc attcccacca gcaatgtatg 24 060 
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agggttccaa tttctctata tcctacccaa acacttattt ctgtttttgt ttttattata 24120 
gccattctag tgtgtgtgaa gaggtatttc attgtggttt tgattttcat ttcccttatg 24180 
actaatgatg ttttatgtgc ttgttggtct tttgtgtatc tttggagaaa tgagcctttg 24240 
cccacttttt gattgggttg tctttttatt gttgagttgt aagagttcat tacatgtttt 24 300 
ggatactagg ctcttatcag atatatgatt tgtaattttt tctcttattc catgggttgt 24360 
cttttttctt tttcactttc ttaatagtgt tctttgatgc acaaaagttc ttcattttga 24420 
tgaagtgaaa tttatctgta ttttccttgg tgcttttggt gttatatcta agaaacaact 24480 
aatctaagag cacacagatt tatacctatg ttctaagact tttatagttt gacctcacgt 24540 
ttaggtctat gctctatttt gagtacattt ttatgtgtga tgtaggaaag gagatcaact 24600 
tcattgtttt gcatgtggat atccacttgt cccagcacca tttgttgaaa agactattct 24660 
ttctctcatt gaattgtaaa cagtattgtt tctaaatttc agtttccaat tgtttgttgc 24720 
tagcatatag aaataaagtt gacttatata tattaatttt gtacccagaa accttgctaa 24780 
tgtcacgtat tagttctagt agatttcttg tagatccttc tagattttct atatacatga 24840 
tcatgttatc tacaaataaa gacagtttta ctgttttact tcttcctttt ttatctagat 24900 
gcctcttatt tcattttctt gcctcattgc actggctaga acttccagta aaatatcgaa 24960 
taagggcaga cattttttgt ctatctcctg atcttagggg gaaagaattc agtagtcaac 25020 
cgttacatgt taactgtggg tttttcataa tacacttcct caagttgaga ttgtttcctt 25080 
ctattcctag tttgttgaga gtttttctta aaaaggatgt tggattttgt caagtgcttt 2514 0 
ttctgcatct actgagataa tcctatggtt tttttctctc cttttgttta ttaatgtggc 25200 
aaattacatt gcttgatttt caaatgttaa gccgactttg ccttgctggg ataaacccca 25260 
cttgatcgtg ggtgtatcct cctatttcta tattgttaga tttgatttgc tacaatttta 25320 
gcatagatgt ttataaggga tactgctctg tagttttctc ataacatctt tttccttttt 2538 0 
tattatcagg gtaatgctag taataaacat agctaaaatg aaaagggaaa tatttcttcc 2544 0 
ttttcaattt tctggaacag tttgtataaa attggtatta tttctgctta aaatattttg 25500 
tagaatttac caataaagtc atctgaccct gaagttatat ttgtagaaag gtttttagcc 25560 
acaaagtcat tttttaaata gaaggctatt catagtgcct gttcttcctg agtgagttta 25620 
ggtgttttgt gtctttcaaa aaattttcca tttcatctaa gttgtcaaat ttactggcaa 25680 
aaagttgttc ataatattcc cttagtattc ttttaatatt tggagaatct gtagttatgt 2574 0 
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cacccctgtc attcttcaaa ccggtaattt gtgtcttctt tcttcttttc ctgattagcc 25800 
tggctaaaaa tatatcaatt ttattgatta tctctaagaa cttcagatgt tctctaagaa 25860 
ctttcaactc taagcttttg gcttaattcg ttttctctat tgtttgtttc ttttctattt 25920 
ctttaatttc tgctttgatt tttattattt tctttcttct actttgggtt taatttcttt 25980 
atctctttct agtttctttt ttttattatt atactttaag ttctagggta catgtgtaca 26040 
caacgtgcag atttgttaca tatgtataca tgtgccatgt tggtgtgctg cacccattaa 26100 
ctcgtcattt acattaggta atctcctaat gctatccctc ccccctcccc caactccacg 26160 
acaggcccca gtgtgtgatg ttccccaccc tgtgtccaag tgttctcatt ggtcaattcc 26220 
cacctatgag tgagaacatg tggtgtttgg ttttccgtcc ttgcgatagt ttgctcataa 26280 
agatggtttc cagcttcatc catgtcccta taaaggacat gaactcatca ttttttatgg 2634 0 
ctgcatagta ttccatggtg tatatgtgcc acattttctt aatccagtct atcattgatg 26400 
gacatttggg ttggttccaa gtctttgcta ttgtgaatag tgccgcaata aacatacttg 26460 
tgcatgtgtc tttatagtag catgatttat aatcctttgg gtatataccc agtaatggga 26520 
tcgctgggtc aaatggtatt tctagttcta gatccttgag gaatcaccac actgtcttcc 2658 0 
acaatggttg aactagttta cactcccacc aacagtgtaa aaatattcct atttctccac 2664 0 
atcctctcca gcacctgttg tttcctgact ttttaatgat cgccattcta actggtgtga 26700 
gatggtatct cattgtggtt ctgatttgca tttctctgat ggccagtgat gatgagcttt 26760 
ttttcatgtg tctgctggct gcataaatgt cttcttttga gaagtgtctg ttcatatcct 26820 
ttgcccactt tttgatgggg ttgtttgatt ttttcttgta aatttgttta agttctttgt 26880 
agattctgga tattagccct ttgtcagatg ggtagattgc aaaaattttc tcccattctg 26940 
taggttgcct gttcactctg atggtggttt cttttgctgt gcagaagctc tttagtttaa 27000 
ttagatccca tttgtcaatt ttggcttttg ttgccattgc ttttggtgtt ttagacgtga 27060 
agtccttgcc catgcctatg tcctgaatgg tattgcctag gttttcttct agggttttta 27120 
tggttttagg tctaacattt aagtctttaa tccatcttga attaattttt gtataaggtg 27180 
taaggaaggg atccagtttc agctttctat gcatggctag ccgtttttcc cagcaccatt 2724 0 
tattaaatag ggaatgcttt ccccatttct tgtttttgtc aggtttgtca aagatcagat 27300 
ggttgtagat gggtgctatt atttctgagg gctctgttct gttccattga tctatatctc 27360 
tgttttggta ccagtaccat gctgttttgg ttactgtagc cttgtagtat agtttgaagt 27420 
caggtagcat gatgcctcca gctttgttct ttttgcttag gattgtcttg gcaatgcggg 27480 
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ctcttttttg gttctgaaaa ctaagataat tgatttagga actgtgatgg ttaattttat 2754 0 
gtgtcaactt gactgggtta aggggtgccc agatagctgg taaacattat ttctgggtgt 27600 
gtctgtgagg atgtttccag aagagattaa cattggaatt gatagatgga ataaagaaga 27660 
ttgccctcac cagcacgagt gggcattatc caatctttta agggcctgaa tagaacaaaa 27720 
gggtggagga agagcaaata tgctctctct gcttgagctg gaatattctt cttcacttgc 27780 
tctcagacat cagtggtcct ggttcttggg ccctcgaact ggaacttaca ccatcggctc 2784 0 
actcattctc aggcctttgg gtttggactg gaactacatg gctggctttc ctgggcctcc 27 900 
agcttacaga cagcagatct tgggacttct tagcctccat aatcatgtga gccaatccct 27960 
cataatgaac ctctttttat atatctctat atctatctat ctatctatct gtctgtctgt 28 020 
ctgtctgtct atttatctat ctatctctct gtctatctat ctatctatct atctatctat 28080 
ctatctatct atctatctat ctatctatcc tattggttct gtttctctgg agaaccctaa 28140 
ctaattcaag agcattattg ctttctaata taggcattta gtggcatcct atgaattcta 28200 
ttatcttgta ttttcatttt cattcattta aaaatacatt ctaattatct ttttgacttc 28260 
ttctttgact tatggattat ttagaagtgt gttatttagt ttccagatat ctaggcattt 28320 
tccagagatc tttctgttat tgatttctaa tttaaattca ttatagtcat agaatatact 28380 
ttgtatatac tttactaact tgaattcttt taaatttgtt gaaacttgtt ttatggccca 28440 
gaatatggtc tatcttggta aatgtgctgt atgcttttgg aaagaaagta tattcctctc 28500 
ttgttgggtg gagtgctcta taaatatcaa ttaggtcaag ttggttgata gtgttattca 28560 
tatcttctat attcttgctg atcttctctt tgcttattct atcaattatt gaaagatggg 28620 
tcttgaaatc tgcaatttct atttctactt gaagtactgt cagtttttac ttcaggtatt 28680 
ttgaagctct gttattgggt acataagcgt ttagaaatgt tatgtcccct tgatgaattg 28740 
accctttcat cactatgaaa ttaacttctt tacctctgga aattattttt gctttgaaat 28800 
ctactttatc tcatattaat atggacattt cagatttctt ttgattagtg ttagcatggt 28860 
ataacttttt tccatacttt taatctattt ctaactttgg agttaaaatg tttttcttat 28920 
aggcagtaca tagttggctg ttgcttttat atccaatctg gaaatctcaa tcctttattt 28 980 
gggtatttaa gccatttaca tttcatttaa ttattgatac acttaggttt aaatctattg 29040 
ttttgttatt tgttttctat ttgtcccatc tattccttgt tacatttttt tttgtcttgg 29100 
ggattaactg aatgtttttt atgataccat ttcatctcat ttattggctt attagctatg 29160 
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actttttgtg ctgtttagtg attactttta ggtttattat aaacatcttt aacttacaca 29220 
gtctaccttc aagtgatatg taccacatca catatactat aaaaacttta caatagcata 29280 
tacccatttc tccccacaac ctctgtgcta ttgtagtcat atattttaca tttacatatg 29340 
ttacaaacct cacactacat tgttattatt tttgcctaaa caatccacta tctattaaag 294 00 
atattcaaat actttttaaa aaatctgacc tatttacctg tgtagttaat tatttccagt 29460 
gttcttcttt cctttgtgta gactcatatt ttcatttatt attcttcttc catttcttgt 29520 
agcatgggtc tcctagtgat aaattatttg agctttcata taactgaaat gtttttattt 29580 
ggccttcatt tttgaaagat atttttgcct gatacacaat ccctggttaa cagttttctt 2964 0 
ttttctttca gtgttgtaaa gattttgctt catcatcttc tcacttatat tgttcacaat 29700 
gagaaatctt gtatttgtct ctctgtatat aatgtgtctt ttttctctgg ttgcttttaa 29760 
gatattctcc ttatcactgg ttttgagcaa tatgattaca atatgtctcg acataatttt 29820 
cttcatgttt cttgcacttg gcatttgttg agtcccctga atctgtgcgt ttataattct 29880 
catcaaattt gaaaaaattt tgaccattat ttcttcaaac ttttttctgg ccccttttct 29940 
ttctttttct ccatcaggga ttccaattac acatataaga ggccatctga atttttccta 30000 
cagtttactg atgctcaatt cattttgtaa ttctcttttc tatgtgtgtt tcatttcaaa 30060 
tcatttctat ctctgtgtat tcaagttcac taatgttttc ttctgcaatg tctaatcttt 30120 
tattaattct atccaatgtg ttttttacct cacatattgt ggttgtcatt accataagtt 30180 
tgatttggct attttcaaaa atatcttcca gccaggcacg gtggctcacg cctgtaatcc 30240 
cagcactttg ggaggccgag atgggtgaat cacgaggtca ggagatcgag accatccgta 30300 
ccgaaaatac aaaaaaatta gccaggcata gtggcgggca cctgtagtcc cagctatttg 30360 
ggaggctgaa gcaggagaat ggcgggaacc tgggaggcag agcttgcagt gagccgagat 30420 
ggcgccactg cactccagac taggcgactg agcgagactc tgtctcaaaa aaaaaaaaaa 30480 
aaacttccat gtctccacct aactttttga acacacagaa tacagttagt tgtaattaac 3054 0 
ttttttatca ttgtctgcta attctaacat ctgtgtcagt tctggtttga tttcaattaa 30600 
ttgatttatc tccttcttat tagtcacagt ttcctgctct ttccatgcta gggaattttt 30660 
tacaggatgc cagacattgt gtattctacc ttgttggttg tttatatgtc taagtttcta 30720 
taatcagtgg ttcccatcta aaggcagttt tgtttgtctg aagacattct ttgttgttac 30780 
aactggaggg tgctattggt acccaatggg tacaggccag gtatgatggt aaacgtcctg 30840 
caatacacaa gacagccccc acaataaaga attatgaagc ccaaatatgt caatcattct 30900 
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gaggttggga aaccttgctg taattattct 
gttggaaatt gtttgatcct tttggggctt 
gagtattaag tttggggcta atcatttcac 
tacttgtctt agttcaatca ggttgctata 
caacagaaat ttatttctca cagttctgga 
tcatctagtt cttgtgaggg cccactttct 
cttcagatga cagagagaga gagaatgaca 
ctatgtctcc ttctaagggc attaatctca 
acctcccaaa ggccccactt cctaacactg 
aatttggggg gcgaggggta tataaagact 
ttagctggtt gagagcaggc actattccca 
taactatttg ggatagttcc ttttcaaaca 
tctacattgt gtttgttcca tccccacttc 
aagatgctta acactcatca tatccactct 
gtatctccag taaatgttgc cagtccctta 
ctccctcctc atcaccccaa tattcaatca 
aatgtcccct gcaccctgtc cttactttcc 
catcatgtcc tgctttggcc cttcatgtgg 
tccccctcca ctatacaccc tcacttccag 
aggccgtgcc gtttcacaac tctatgtctt 
agtttttctg ttttctgcct ggcaaattac 
caagcccttt ttgatttctc gaggcaccta 
gttctgttca cactgtgtta acaacattgg 
ttgctatagc tcctagggat ctaacaatcc 
acaacttcca aaaaactcat atgctgaagc 
taggctcttc agcaggtaat taaggtcaaa 
caggactgca gccttataag aagaagagat 
catagcaaga aggcagccat cttacaagcc 



tgagttttgt tccgggactc agttaagttt 30960 
tctgttaagc tttgttaaat gagaccagaa 31020 
catactgaag caagaacctt ctgtgtactc 31080 
acaaaatacc aaaactgggt aacttataaa 31140 
a ?jctgggagt ttgaagccag gacaccatca 31200 
ggtttgcaga ttgctatctt cttgttgtac 31260 
gagagagaga gaaaggaatg aagctctctc 31320 
taattggggc tccatcgtag tgacctagta 31380 
tcaccttggg ggttaggatt tcaacatatg 31440 
cagactgcag cactacccaa tactctaatt 31500 
gccctgtgtg aattctgagc actgttcttt 31560 
ttaaatagtt tcctcacgtg agctagtgaa 31620 
actgctagta cccaacctca gaactttcta 31680 
cacccccacc ccagacccac tcctcctcct 31740 
agttagaaac tcaggagttg ccctccactc 31800 
gttatcaaat gctgtcaatt tgaacttgcc 31860 
agcctcacta tccttgcctt cttagaccat 31920 
caaccccaac ttgcatctcc aacatccccc 31980 
tgctacagaa cctttcattg cttatgaagc 32040 
tgcacatgct attccctttg cctgaaatat 32100 
ttctcatacc ttagatgtca ccttctctgt 32160 
ttcacccctg ccccttcgtg ctcctgctat 32220 
tcacatctgt ttgcactgct gctttcccca 32280 
ctgtaggtgc ttgctatgga ctgacttctg 32340 
cctaaccccc aatatcacta tacctggaga 32400 
tgaggtcata agggtggagc cctaagcaga 32460 
ctttctctct cttcctccgc cacgtgaaga 32520 
aggaagagaa ccctcaccag aacttcacca 32580 
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tgctggcaca ctgattttgg acttccagcc ccaaaaagtg agaaaattaa tttctattgt 32640 
tgaagtcaca cagtctctgg tatttttatt atggcagcca ggctgactac tacagtgctt 32700 
aatgcatggt gagtgcattt ttaaatgaat ggaaaaggaa gaaagaaaca gaagaggaag 32760 
aggtggaaca aatggggaga aagatgaaga aagcagaagt taagaatgag agaaactaag 32820 
gaaacaggag gaagaaggta gaggatggga atgagctcac gaaggagcca gcatctcagt 32880 
ggagaccatg ccagtgtgag ggaactactg ccaactagga ggcaggtggg tgcaggctcc 32940 
agggccagtg ctgagcatcc aggcacaaga cttttaagag agagtgtgtg cgtgtgtgtg 33000 
tgtgtgtgtg tgtgtgtgtg tgtgtgtaat tttttcccct agagatagcc tagaggggcc 33060 
acctttgtgc ttttctcaga ggctatgtgt cctcattggc tgggtaacag ctgggtgccc 33120 
ttgcctcccc ggccaaaggc agccgccgtc cacacccagc tgcctgaggg acaaggaggg 33180 
tgtttaccca atggcctgtg tccttaaagg actctgctct cagttacaca gcaaacagct 33240 
ggcctcgaag gggcctcgtc ctggccgccg cgagcgggcg gcttctgcag cgtttaagct 33300 
tctatctacc atgaagatgc ttcccccacc ctcatgcctc ctcccggggg cgggcatgag 33360 
gggcacttac gagctgagac ctcacaggcg acatttttta agctgctcat tctgacggaa 33420 
cagtccccaa gacaaacaca aggctttctg tctcccaggg cttgtgcgac ttggtttcag 33480 
actctgccac ccactgccgc agacagcggc cgtcttccca ggaagaagac atttcctggc 33540 
ggggtaccgg gactgacggg cggcggcgca gggcagtgcc agccgggccc aggaagtgaa 33600 
cgcgctgggc ccgcgaagga gcgcgtcgcc cgccctgagg ggggcagtgc agcgcccacc 33660 
gcgcgccttc gccagccccc tggcaccgtt gccctggcca ggggccggga gagggcgtgg 33720 
cagcgcggca gtgcgcctac gcaccggggg cactggggat cccgccccca cgctggttcc 33780 
cgcgctttca agagcctcag agccccgaga caagggggtg tcggctctac ccacctggcc 33840 
cagcccgaaa gaacagggcc aacattttga aaccaaacca ataatagaat tggaatactt 33900 
ttgatcactt tttattttcc ttttctggag gagagttgtg gggcgggggt ggggagctgg 33960 
tagagagaat ctcagggcgc tgtggctttg agggcgggtt ccctgcgcaa gcagcagtgc 34020 
tcacagtgcg agcaatcatg gctaggccca gcccgcggtt agttgggtgc agggacgtga 34080 
caggaccgcg agggctgatt ctttagtcta ttgttcccac tcctcagaaa ggactcccag 34140 
caaatatata ctgctagtcc cagctttccc gatggaatgg aaccccttgc tttgctttgc 34200 
tgccgctatt caggatgggg attttgtcca tgcagaaaga catttacact atctacactg 34260 
gaatgccggc ccactagcct ggcaaaatgc tgagccagct attagatctg gactcgtgga 34320 
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ggagtggaga tttgcctcca ttcaagcaac cacatacttc cctggaagct gggaggagca 34380 
ccactaagtc caccctctcc tgcctggctc cttggtagtg cttctcaaac attatcactt 34440 
ggacaaatca cctgaggtcc ttgttaaaag tctgatttag aaaatctggg gaatccggtg 34500 
gcacccgaga ctgcattttt aacaagctcc caggtgatgc agctactgct gtgcaccaca 34560 
ccttgagaag caaggggtac aacattgctc tcaactgcgg ctgcacatta aaattaactt 34620 
ggggagcttt aagaactaat taggccagga cccccacccc caagatattc tgatttagtt 34680 
ggtctaggta gagtcaggcc tttttttttt ttttttgacg gggtctctct ctgtcgcccc 34740 
tgctggagtg gcgggatcat ggttcactgc agtttcaaac tcctgggccc aagctatctt 34800 
cccacctcag cctcccaagt agctgggaca acaggcgcaa gccaccatgc ccagctaatt 34860 
ttttttattt tttgtagaga cagggtctcc ctatgttgcc caggctggtc tcgaactcct 34920 
gggctcaaga gagcctcctg cctcaacctc ccaaagttcc gggattacaa gcgtgaggca 34980 
gccaccatgc ctgatctggg cttttttttt tttttttttt tttttaatta agctttttgg 35040 
gtgattataa tgggcagcag gattgagagc taatggtaca agcagctatg aggaaggact 35100 
ttggctgttg ccaggcctat tgatttcata ctgggcccaa ctgtagggtc cagtttatga 35160 
ccaagattag gaattcagtt tgtgatcaaa gaacataatt tgtccatctg gcttcaaaca 35220 
cttcatctta cctcacattc attcaatcat caaatattta ttgagtcact gtattgggca 3528 0 
ctgtgctagc tactaaggat tcagcagtga gtcagacaga catgggaagg tggtgacatg 35340 
tttaagcagg caggtgacat gatcagattt gcattttaaa atgatcagtc tggctgcaga 35400 
gtgaacagat tggagggggg ccaagtggct gtgtacagaa gcccttgcat taaaccaggg 35460 
gagaaatggc agtgtaggcc agggtggtgg cagggaggag cggaggaact caagggctac 35520 
tgaagaggtg aaatcaacag gacttgataa tagatttgac agggggaaat gtgagagagg 35580 
gggactgtca aggataactg ctaagtttca gatttaggca actgaatgaa tgggggcacc 35640 
gcagagaaaa agaggcctgg aagaggacag tatttggaag gaaggtcata tgtttggtcc 35700 
aagtggagct gtcaagtatg cagttggata agctgtctga aacttagagg agagatctga 35760 
gctacagatg taaatttggg agccatgagg atagggggtg tgaatcaatc acctggataa 35820 
agattgtaga atggcaaaaa caaacaaaca aaaaaacaac ttagggtttg aagaactcta 35 880 
tcatttactg gcttgctctt ccaaggtccc tactattggg caatgctaac ctttccgctt 3594 0 
cctaatcact gtcatttttt aaatctcatt cattaaagac ttttggtaaa agagttcatg 36 000 
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gttttctcct cttccagaaa ccctgacatt ctcatggatg acttccacat ccatgcagac 36060 
accccctcca atgccatgtt ctctctattc tttgacttgc tcatctttac ctcccgcacc 36120 
cctcctcaag ttacactctg gcacagccac accttgggcc ttgacattgc tttcagctgt 36180 
gccactttaa aaatcactct ttcaagcctc cccgctctgc taccttctca tcttccagtt 36240 
ttcttgtgtg attattcctg tgagaactgt tcttcaactt tattgggatg gccagcccac 36300 
cgacccctct acttcttccc actctaccag ctagctctct gctttcctga cttcattatc 36360 
tagcctaaag ccttccgttt gtcaatccaa taccaaagat tggcagaggc tgccatgctc 36420 
ccaacagatc ctggataaga ggtggggagg ctgaggtctt ctgcgagtgt gcagagggta 36480 
ttaaagttgt atgctgtgga ttaaaattta gatgtgggtg ttctgggtga cataggaaga 36540 
agaacctcta ttttgttttg aaagtgggcg tacctgcctt tcattcttgc ctggacaggc 36600 
cacagcccac ctcaagctac taaagtgcct gggggccaga ggcaacatgt tccttaaaga 36660 
agcagctgtg ggtcagctca agtgtttggg caagtgttca gataaagctc cagaatacag 36720 
ggtcttggag cataagaaca tgatattagc ctagactgtt ggaagggtag gtgtccctgg 36780 
tcatttaccg ttagtgacct tagatctgat gtttgagtag cacggacttg gggctagggc 36840 
cgaagctggg cccacattac agccagagga aatagcagag ataaagcaaa ggaggtggga 36900 
tgtgtacagc actcccagcc aacagtgaac aatctaattg gctggtacag ggtgtctggg 36960 
aaggagcttg agattccatt gcccatccgc atcctacact tggccttttg ccatttgaga 37020 
aactaattaa tagacacata ttgtgataga gagtgagatt gttggggatt ggagggtccc 37080 
cacccattcc aggaatgact ggaaagggcc tgcactgacc gacttaggct cttcttggaa 37140 
gatgtcacgc attctaggag aattgtgacc aattgcaacc agacacactc ttggtcctct 37200 
tggtcctttt ttgtgtcatg gaccctttag caatctgttg aagcctttgg tccccttttc 37260 
agaataatat ttttagacgc ataaaataca aaggatttaa atggaaacca agtatagtga 37320 
aatatagcta tcaacctttt tttttttttt gagatggggt ctcactctgt ctcccaggct 37380 
ggagtgctgt ggcacaatca tagctcactg taacctcaaa ctcctgggct caagtgatcc 37440 
tcccacctag gcctcccaaa gtgctgggat tgtaggcatg agccactgca cccggtctcc 37500 
aacatatttt ttaaaatttg tgatgtctgt gcttcattat tagcacatta tataactaaa 37560 
tctagctgtg agtcttataa ctatcacaat ttcaaagcaa tgatgaacat attgctattt 37620 
caagagatgt gcttcattct aatgtgatat gaaaatatgt gtaatttctg ctggtggcca 37680 
agtcacaggt cctctaatac tatggtggtt tatcacctgc attcatgatt gaaggaaatg 37740 
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ctaaaatata gttaaaagat agcagaaaca 
aatcctctga attccagcca cagactactt 
acatagcttc ctggtgttcc tttgtggaaa 
ataacaaagg cctgcagact ggttggctta 
tggaggctgg gaagtcgaag atcaaggcac 
ttccttcgta gatggcacct cctagctgta 
ctcttgggcc tcttttataa ggttactaat 
taatcacctt ccgaaggccc agcctcctaa 
atatgaattt gggaggacac aaaaactcag 
gctggagggc tcttttccag gatcatggtt 
cctggccttc cctccactgc actcttagag 
gtagtacttt ccacctcaag tagggttcct 
aagtacaggg ctctgtgcta agccctgggc 
tcagtaataa taactcccat ttaccaaaca 
ttcattcatt cattcagcag agatttacca 
ctgtaggtag agcagagact aagacaaact 
ggggaagaca gatgatatac aaaataagta 
gctatggaga aaaacaaagc agagaaggag 
tgtccgatta gttgaccagg gaaggtctcc 
gaatgatgcc actgaagctt tgcagatatt 
agctaggcac ggaccctgag gcaggaatgt 
ccgtggggct ggagaagagt gagtgagaga 
atgggggtgg gggcggtgac agattgagca 
tcttacccag agggagatgg gaagcccatt 
gctgttgcgt gatagtatta gtattctcat 
gttgagtgac ttggctggtg agtggcagag 
tcacatgcct cctccctcca ccagcccgag 
cccttctatt tcctccagac tgttgttcct 



aagatgtagt tttccccatc taagttcatg 378 0 0 
agaggaacag ggaccccaag ttaaagatcc 37860 
ttccctttat cttagtccat tggggctgct 37920 
taaacagtag acatttattt cttacagttc 37980 
tagcagattt ggtgtccaac aagggcccag 38 040 
tcctcacaag gtggaagggc caagggtact 38100 
accattcatg agggctctgc ccccatgacc 3816 0 
tatcatcatg ttaggggtta ggatatcaac 38220 
atcatagcac cttttgtctc aagtctcctt 38280 
ggcgatttgg tttttagaaa caggctgagg 3834 0 
cctgggttgt ttgggtttct cttgtggcag 38400 
ctttcagcta atgtcaactg aacatctgct 38460 
ataggaagtt actccataaa gagatagtta 38520 
ctttttacat gtcagacacc atgttgacac 38580 
agtgcctact atgtgccact gttctagaag 38640 
cccagctcac atgaagctta cactctagtg 38700 
agcaagtgag tgtgcaaaaa taggataagt 38760 
gagaaggaat gcccagagag atgaatgcaa 3 8 820 
cggaggaggt ggacatttca gactaggcct 38880 
tgaaggaaga gcattctagg cagagaaaat 38 940 
gcttagctgt ttgaggcatg tggaggaggc 39000 
aggagtggta actacgagag cagagaggaa 39060 
ggaccttgga ggtcttatgg tgaccttggc 3 9120 
gaagggattg aacacagtcc ccacaacaat 3 9180 
tttcaaatga ggaaacgaca gctcagaaaa 3924 0 
ctgaaactgg accacagacc tttctggagg 39300 
gctcggagtg gggtcagcat gttctctact 39360 
ttgtcatttc cccaaggacc atacagtgct 3 9420 
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ctctgtccac cattcttgct gtatccacca acctctgcta ctggactatg agccccttac 3 9480 
ttactgagca cctgctatgt gccaggtact atgttagacc ctgagcataa aatgatgaat 39540 
aagagaagtc agctccttgc tctcctggag cttttattct agtggatggg aaacacaggt 39600 
aggtcagaaa attttaaaaa gctttctttt tctttttttt tttttagaga cagggtctca 39660 
ctctgtcaca catgctggag tacagcagcg tgatcgtaac tcactggact caagttattc 39720 
tcccgcttca gcctcccagg cgtgtgccac cacacctagc tttgtgtgtg tgtgtgtgtg 3 9780 
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tagagagaga cggggtctta ctatgttgcc 39840 
cagtctggtc taaaactggg ctcaaatgat cctcctgcct cagcctccca aagcactggg 39900 
attataggca tgagccacca caaccgccaa aaatctttag taataagagc catgcagaga 39960 
aaagagagta gctattttgt aaagaggtag ggaaggcctc tgtgaggagg tgacatttaa 40020 
attgagatct ggatcataag aaggaaccat ctcagaggga gagcattcca ggcaggaggt 40080 
atgagagcaa aggctcaaag gtggtcatga gtttggcctg ttagaggaac aggaagaaga 40140 
tgcatggcaa gggatgggaa catggcagga gatcaaggta gaaatgcagg ggccagatca 40200 
ggcagggcct tctaggtcat gaacatatgg gaatccactg gagtgtttca agcaggagaa 40260 
ggaaatgatc tgattggtgt tttaagatca tgctgacttc cttatggaga gtggattggt 40320 
gggcagaggg atggtggaga aaagtgtaaa gattaaggat atgtttggtg atgaagtcaa 40380 
ccagatagcc tgatagatta gatgtcatgg atgaggtaaa gagaggaatc aaagatggct 40440 
tctatgtggg ggttgaacaa ctgcatggat ggtgtctgtt caggctgcta catcaaaata 40500 
ccttagattg ggtggctcat aaacaacaga acttttttct tatagttctt gaatgtgagt 40560 
caaagatcaa ggcaccagca gattcagtat ctgatgaggg ctcaccctct acttcataga 40620 
tggcaccttc ttgctgtgtc ttcacatggc agaaggggca aacaagctcc ctgaggcctc 40680 
ttctgtatgg gcactaatcc ccttcacctg ggctgtctcc taaagtcccc accccttaat 40740 
accactgcac tagggattag gtttcaacat atgaattttg ggaggagaag gggggccaac 40800 
attcagacca taacagatgg ggatgccatt tacagaactg aggaaggcat tgggggagtg 40860 
aatttggagg aaacgttgag ttgttttgaa catgtcagtt tgagatacct actaaacatc 40920 
cacatagtag tgtcaaatag acagttggat ttatgagtct gcagttctta agagtagtct 40980 
aatctaaaga tatagagtca gaaatattgt cattgtgcat acccacctag gccgagcaca 41040 
gtgcctggca tatagcagcc agcctaaaat ttgggtgggt tggttgaatg aatgaatgaa 41100 
attcttagag cttcagaatg agatagatct tggttcaaat ttaaatcctg gctctgctac 41160 
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ttactaattt aaccttgaat aaggtattaa acctttcaga gcctcaattt cctcacctgc 41220 
aaaatggaga tagtgataat acttatccca tagggtggtt gtaaggatta tactaaataa 41280 
accatgtaaa tcacccaagg tgcttagtac agtccctggt gcatagtaag cactcagtaa 41340 
attggtggct gttacaatat tcaaaatatg agataggccc caagaaagca agccttggag 41400 
agacactcca gagggcagcc ccacagggtg ggacaatgta gctcagctat atgtgtgaag 41460 
cagtaatgcc tagtgggtag gtgttgctcc cttgagccaa actgtctggg tttataatcc 41520 
tgactcagcc acttagtagc ttacttaacc tctatgtgcc tcaggggggt ttagaaatta 41580 
ttattattta atctctaaaa tagggataat aatagaccta cctcataggg ctgttgtgaa 41640 
ggattaagtg agataatcca tgaaaaaggc tgagaacagt gcctggttca tattaagggc 41700 
tcaatcaatg ttaggagcta ttatcacagg gatgtaggag ggagggggat gcaaacaggc 4176 0 
ttcagctgtg cccttgacct ccctccagag aagacagtat tcagccctgc caggcagcag 41820 
agcccgtttc atgcagctct tctctcccag cactctcctt cctactcctc cccggctgtg 41880 
gcctcccagt ttttgcaact ccctgaatag cttatgaacc tttgccaggg ttggtgtgga 41940 
aaaggccaac tcctccagta gagagggctg cagaagctca gcctcccttc acactggggg 42000 
aagtgttagc cacaggctac aggcaaaggg aatccagatc cccaagctgc ctgaagttat 42060 
ctagaagcag agagaaataa ccaactgaga cagtcccaga gggcccccca agcttctttt 42120 
ccatttaaat gctctctttg ttactaagaa gcaaacctag cccatcccca ctggcaccag 42180 
cctgcccatt gtttccctaa gagaccatcc caagccaagt gctggggcta cctctccctt 4224 0 
aaaggaatgg cagcctggag gttgcactgc agtaagagca tcatataacc aaaattctcc 42300 
acaactaatg gttctagaac aggacccagg tcaggtgtca tgggcagacc cataaagggc 42360 
ccaatcacta gagcatttga cctgcctctt ccccccaggc ttggctacct gtggtgatta 42420 
atgctctgag gcctctgaca tctgagcaat ggacacaagg ctagagccta atcccaccta 42480 
catttgatca aaatggtagg ccttgtgggt ttgttccgga ggtgagaaag cattaaatat 42540 
tccactggaa agacacaatg gagagattgc gtaagtagac ggttctttgg gcagaagaag 42600 
atggatggat ctgaagtggg aagtgccaga agaacccttt ctctaaccag gaggggcctc 42660 
tggtatccct gcagctttta gctctcagtg tgatcagtag actgtctgct tgacttaccc 42720 
aaactcccac ttcccctcct gtcaggtatt tgcttggggg ctattcatct cctctgtgtt 42780 
caccttccat tcagctgaaa aagaggaaga ggatttccca gctctgtctt catccatgca 42840 
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tttttttttt ttgagacaga gtcttgcttt gttgccccgg ctggagtgca gtagcacgat 42900 
ctcggctcac tgcaacctcc acctcccagg ttcaagagat tctcctccat ttcaccatgt 42960 
tggcctcagg tggtcctccc gccttggcct gccaaagtgg tgggattaca ggtgtgagcc 43020 
accgtgccca gcccttattg gagttttttt tgtttggttt ttgtttttgt tttttgtttt 43 080 
ttgttttgtt ttgtttgaga cggagtctcg ctctgtcacc cgggctggag tgcagtggtg 4314 0 
ccatctcagc tcactgcaag ctccgcctcc cgggttcacg ccattcttct gcctcagcct 43200 
ctcgagtagc tgggactaca ggtgcccacc actgcgcccg gctagttttt tgtattttta 43260 
gtagagacgg ggtttcactg tggtctcgat ctcctgacct cgtgatctgc ccgcctcagc 43 320 
ctcccaaagt gctgggatta caggcatgcc ttattggagt tttatgagca tattctgtga 43380 
tgttgaatgt ctagtctata aatgattgcc tagagaagac agggggctta aggaaccttt 43440 
gaacgccccc actggttttg gagaggtggt gagacttttg gactcaaata atggctactg 43500 
tgatgttaac tgtgtttatt gatttacaga agctacgaat ggtgcagtgt acaaatcacc 43560 
ccgaaccttg gtggcttaag acatcagcaa tcatgttatt acctctcatg gtttctgtgg 43620 
gtaaggaact caggaagggc tcaggtggta gtcctggctc caggtccctc atcaggttgt 43680 
aatcagggag tgactggagc taaaacagta tggggtgagg agaatggggc tggagcagct 43740 
ggggacaggc cagacacctc tcacttcatg tggtctcgga gatcttcaca tgatctctcc 43800 
atgtgagtcg gtttgggctt cctcacagca tggtagcagt caaaccaatc acagggtggc 43860 
tcatggcttc aagtgtgatt attccagcaa gcaaagtgga agctgtatcc tcttctacga 43 920 
cctcgccttg gaagtcatct agcttcactt ctgccatagt cacaaagctg cccagcttca 43 980 
gggagaagga acatagaccc tacatctcaa tgggggagtg tcaaagtcat actgtaatac 4404 0 
gctcatgcgg aacagacgat actgtcacag caattgtggg aaaatactcc ctactacacc 44100 
aatcctactt cctgctcttc cctcagctga acctcagaca agccttttgc tttcttgctt 44160 
gcagctgtca gctggtccct cactctgggt gcccttctct cctctccagt gccttcaggc 44220 
cccaattgga gtcccacctc ttaaagtttc cctaacggct tccataagca caacaagcag 44280 
cagttttccc ttattcatag caagcagaca cttacatgtt tgataggcca aagagttgct 44340 
ccacatttgg tcagggccaa tctgaccagt gggtaggttt ccacatctgg gccctaatac 44400 
ctggccaggt ggaacatcta gattctgaaa catcctcctc attcttccag ggtctcagaa 44460 
cctcagggtg acatagcttc tggccataaa tgctaggaca gagctatgtt cattgtgcaa 44 520 
tgtctcttcc aaggtaggca catgggcaga tgaggctctg tgcagaaggc cttggcagtc 44 580 
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agatgagagg tggcagtaaa atgtgttggc aagagccttt ggctgggaat caggaagcca 44640 
gggtccccga accagctttc tggctaattt gatgcattca tgatctgaga caaacctccc 44 700 
aatctccagt tgacctcagt ttccctacct ctgcagtata aaacctagag cttgtccttc 44760 
ctacctgtgt ggtgagcaga caagaagaga gacagggctg ggcatggtga ctcacgcctg 44820 
taatcccagc actttgggag gccaaggtgg gtggattacc tgaggtcagg agttcaagac 44880 
cagcctagcc aacatggcga aacccttccc tacaaaaaaa atacaaaaat tagctgggcg 44 940 
cggtggcaga tgcctgtaat cccagctact tgggaggctg aggcaggaga atcactggaa 45000 
cccgggaagc agagattgca gtgagccgag attgcaccat tgcactccag tctatgagac 45060 
agagcaggac tccgtctaaa aaaagaaaaa agaagagaga caggaagaac ttctcaagag 45120 
tagaacatgc taataattct tgtactgtat ttgtttcttc tctcagccgc cacttcctct 45180 
ccagcatgta ggctccaagg tagtgggtga gtccgtaggc gcatgattat ggaacaggga 45240 
tgggccaaca gcctttccct atacacatac agtaggattc aggcaggggg gtttggcaat 4 5300 
gtggccattc cccagttgct gaagggctta ccctgggtgg atttggtgag gctgagcaaa 45360 
tggttaggca cagagctaac aattccaccc tactctgctc caacacacac cccccgcccc 45420 
cattagaatg ttctttatta ggctactaat tggctgtgtg gcctcagggg gattccttcc 45480 
cttctctggg gcttagtttc cattgaagaa agtgaacata tcaccacccc tgcttctgga 4 5540 
gagaagcttg attatttctt gacccccatc atttctattg tgcaagggtg gcatgtgatc 4 5600 
tctccagggc tgggaagcat taagtcagaa tagctgtcat caaaacaata catgctttcc 45660 
tcaaaataaa gggggctccc gtaggcacct tcacctcttg ggacttaaag gctggctgag 4 5720 
gcagagtgtg ggcccagaag ttagcaaatc tggacgtgtg aagtgacctt ggctaagtca 4 578 0 
gagaatccca aacgcgcaga gctcaaaggc actgccaaaa acctcaagcc catcagctgt 45840 
cagcctccag aataaatgtg tcccagatga cgttcacatg gccacgtcag agtttggggc 45900 
aattctccgt cagctgaacc ttcatccctc ttccttccac tctggaaaac atatcacttc 45 960 
tctcgacttc tgagaagaac ttgcacatgt gcacaaaggc tggccaccgc agcactattt 46020 
gtaatacaga aaactcagaa caacctaaac gtctcccacc agggggctga ttcaatcaat 46 080 
tatgggacag ccatattaca gaatgctgtg cagatatctt tttatgtatg agacataaat 4614 0 
gtgtggataa agttatccaa gacatatttt tattttattt taattggcaa ataataattg 46200 
tatttgtcta tggagtacaa tgtgatgttt tgatgtacag ctggccctac ttattcgtgg 46260 
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gttctacatc caaggattca accaagcttg gatgaaaaac atttgggaaa aaataatgga 46320 
tgttgtgtct gtactgaaca cgtacaggct ttttttcttg tcaggattcc ctgaacaata 46380 
taacaactat ttacatgcca cttacattgt attaggtgtt atacacaatc tagagatgat 4644 0 
ttgaaatata aggcaggatg tgcataggtt atatgcgaat actatgccat tttatatcag 46500 
ggacttgagc atcctcagat tttggtaccc gcaagcagtc ttgggcccgc tatatgtata 46560 
cattgtacaa agattaaata aagttaatta gcatatctat cacctcccat atttatcagt 46620 
tttttgtggt gaaaagcata ttgttaagtg gcaaaaaaaa aaaaaaaaaa gctgtggggc 46680 
tgggcctggt ggctcacgtc tataatccca gcactttggg aggccgaggc aggcagatca 46740 
cctgaggtca ggagttcgag agcagcctga ccaacatggc aaaaccctgt ctctactaaa 46800 
aatacaaaaa ttagccagtg tggtggctcg tgcgtgtagt cccagctact cggggaggct 46860 
gaggcaggag aatctcttga acccaagagg cagaggttgc agtgagccga gatcacgcca 4 6920 
ctgcactcca gcctgagcga cagagtgaga ctctgtctca aaaaacaaaa acaaaaacaa 46980 
aaaacaagct gtgggacaat atgtatgata caatatatgt gataaaatgt acacaactat 47040 
atatttctat aggtactgta catatacatg taatgtatgc acggtatgca actgcaagga 47100 
cagaagtgta ggatacccca aaccctaaca gtgttcaccc ccagtaaagg tactaaaatg 47160 
agaaacgtgt cttaaggagt tctttccaaa taactaaata ctttggattt ttgccaacgt 47220 
gagttgattc atcaatttat tttctaatta aaaatgcact ttcacagtaa agcatatgaa 47280 
aaggggcaca gaaacagatt agtggctgcc aagggttagg gatggcttgg agggaacagg 47340 
tgggcatggg tgggtaggag taacacgagg gagtttattg atggtggtag ttatgcaaaa 47400 
ctatacacgc aatgaaactg cataaaactg tgcacacaaa catgcacaca caaatgggcg 47460 
catgaaaaag cttgagaaat ctgaatcagg tctggagttc acttagtggc attgtactga 47520 
tgttatatcc ctggccttga cattgtacta tagttatgta agatgttacc agcggggaaa 47580 
gctgcctgaa gaggacatgg gacctccctg tattattttt gcaacttctc gtgagtagtt 47640 
ctttgaaaat aaaaagttta ctaaaagggg ggcacaaaaa gctttttgaa tatttggagg 47700 
tgatggaagt gtt teat ate ttgaatgtgg tagtggttac atgactgttt acctttgttt 47760 
acctttgtta aatctcattg aacttgtaca catcaaaggg tgaaattatg ccttcataaa 47820 
cctgacttta aaaagtctca ggecagaact gctctcattt gttaagatga agaaatcatt 47880 
tacaagcttg tgtgctttgt tgtttatttc aaaattacat gcaacacact tcacaatgac 47 94 0 
cagtgatgee tgtgcccagt gggaggtccc cgttagatgc tcagccacct ecagegatga 48000 
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ggagcccatc acttaatatt gcagcccata tcattgtcag gagaattgta atagaaagtt 48 060 
cttgattagg cctgtaatcc cagcactttg ggaggcctag gcggacggat catttgaggt 48120 
caagagtttg aaaccagcct gcccaacatg gcaaaacccc ttctctacta aaaatacaaa 48180 
aattagcagg gcgtgttggt gcatgcctgt aattccagct actgggggtg ctgaggcacc 48240 
agaattgatt gaacccagga ggcggaggtt gcagtgagcc aagatcacac cactgcactc 48300 
cagcctgggc aacagagtga gaccctgtct caaaaaaaaa aaaaaaaaaa aagaaaagaa 48360 
agaaagaaag aaaagaaaaa ataaagttat ttattaggct actagttagc tgtgtggcct 48420 
tgggtagatt acttcctgtc tctgggcctt aatttcctta taagaaaaga tgacctctga 48480 
gatcccttct cgctctaaca ttctaagtct ctaaaactgg gaatttaagt tttgacctgg 48540 
ttttgccttc tggagtcaca cagaacttaa ccctgtgtgt tgccaacagt cagccagccc 4 8600 
atcgtacact agaagtcagc tagcagctca gctccctagg gcaaacatca tcccctggcc 48660 
tttcccattg tcctaccaag aaatggaatt gcaggatctg gtgatggtta ggatcacagg 48720 
atcaggagtc aagcagcctg gagtctgact gtggactttg ccacttccca ccgtgtgacc 48780 
ttaggcaggt cacttgtgca atcttcagtg tcctatctgt agaaagggaa ttagtagtac 48840 
cagcctcctg gggagagggc gtgggcatta aatgaggtaa tgtgtgtaaa tcaggatgtg 48 900 
tggtaggcag gataatggac tccccaatat ctgcacat cc taatccccac aacctgtgaa 48960 
tatattgtgt tacagtggca agaggaaatt aaggctgcta atcagctgac tttaaaatac 4 9020 
ggagatcagc ggggcatggt ggctcatgcc tataatccca gcacttcggg aggccaaggc 4 9 080 
aggtggatca cttgaggcca ggagttccag accagcttgg acaacatggc aaaaccttgt 49140 
ctctactaaa aatacaaaaa attagtcagg tgtggtggta tgcacctgta atcccagcta 49200 
cccgggaggc tgaggcaaga gaattgctcg aacccaggag gcagaggctg aagtgagccg 49260 
agatcacacc attgcactcc agcctgggca acagagcgaa tctccatctc aaaataaaat 49320 
aaaatagaga gattggggaa taatgatctt ggattatcca gagaggccca atatgatcac 49380 
gagggtccct ataagtgaaa gacagaggca ggaaagtcag actcagaatg ctgcaatgtg 4944 0 
agaaagtctc caccagccat tgctggcttt gaagatggaa aggcatcgtg aggcaaagca 49500 
tgcaggcagc ctctcgaagc tgaacaaggc agggaaacag atcctcccct tgagcctcta 49560 
ggaggaacgc agacttgctg accccttgat tttagcccag tgggactcct gttgaacttc 4 9620 
tggcctacag aactgtaaaa gagtacattt gtggtttggt ttgtttgtgg cgtcttttag 49680 

65 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 991 1 799A2_I_> 



WG99/11799 PCT/US98/18426 

gggattgttt agttttgttc ttgtctttgt agagacaggg ccttgctatg ctgcccagac 4974 0 
ttgccgcaaa ctcctggcct caagtcatcc ttccacctca gccttccaag cagctaggac 49800 
tacaggcaca caccaccacg cctggctaat tgtttttaaa ttgttctgtg gaggcggggc 49860 
acagtggctc acagctgtaa tcccagcact ttgggaggcc aaagcgggcg gatcacttga 49920 
gcccaggagt tcaagaccag cctggccaac atagcaaaac cctgtctcta ctgaaaaata 49980 

50000 

caaaaattag ccgggcctgg 

<210> 5 
<211> 44453 
<212> DNA 

<213> Homo sapiens 
<400> 5 

tggtgcatgc ctgtagtccc caccgctgag gaggctgagg caggagaatc acttgaaccc bu 
aggagtcaga ggttgcagta atctgagatc aggcgactgc actccagcct gggtgacaga 12 0 
gcacgactct gtctcaaaaa aaaaattctt ttgtggagat gaggtccttg ctatgttgcc 18 0 
cagactggtc atgaactcct gggttcgagt gatcctcctg cctcagcgtc ccaaagctct 24 0 
gggattacag gcatgagcca ccatgcctgg cccatttgcg ttgttttaag ccactaaatg 300 
tgaggcaatg tgttgcagtg gcaataggaa actagtacag atggctgctg ctccctcctg 360 
accggtctcc ctgcacccac cctcgcatct ccccagtcca tcccccacaa cacagccaaa 420 
gcaatgttcc caggacctta gtcaaactaa gtcacatccc ttaatacaac tcgtctcccc 480 
gtaccttaaa tgcgcctcag ggcctttgaa cctgctgttc tctttgttta aaagatgtca 540 
tggtcgccta agtaatacct aatctacttt cataaactag ctcaaagatg aaaatgctgc 600 
ttcctcaagg aaggcttgag gaacccatct gggtcaggtt ccctaggtaa tagcaccctg 660 
tcctcctctg tagcacttgt aggtttttaa ataaatccat gtgttgatct gcttcagggc 720 
ttattgcctg cctttgctac caggcacgta agcttcccaa gggcagggac agtgtttgtc 780 
ttattcacct cccttagccc agtcgctggc acaaaggaag cacttaataa atggtagcta 84 0 
ttatcattat tttatttctg aatcctcacc attctgatca cctccccaaa cactccctag 900 
tttgaagcgt cctttctgta aagtggtact cctaccttgg cccggtgctc caggggtggt 960 
atgaacaaca aaatgtagaa gctgacagtg ctgtccccac ctactgggga cataccactt 1020 
ctattcatgc tgcttaaggc taccttcctt tatcaccaac ttcatcatac tctcagctgc 1080 
aacagagctg gaggtcactc acacctacac acccacctgc cgcaatcagg acgtctgcca 1140 
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agccagcttc ccgccatccc gtttctgtat ggttgctttt tgtaacctca gtttcagctt 12 00 
tactctcgtt cctatggggt cccatcattt tcttgggcca ttgttgtaac ctgtcccaat 1260 
ctttccatgt ccctattcag ccagccattc tattaaccaa ccctcctggc ctcgtgtcat 1320 
ctacaaaact gagcagtgcg cgtccctgtg cctgcttccg aatcctggcc aacaatatgg 1380 
cgcaggacaa aggccaggag caaggctctt gaaagtgcta ctggagagct cctgaggccc 1440 
ggtatccatc tcagggggca gcagaggggg gtgtttaatc agcaattcaa actgtgccta 1500 
ggccaagttc ttacgaggct gtgaaattgg gaatgatatt agtttcctag ggttgccata 1560 
agaaaatacc accaactggg tggcttaaaa caacagaaat atatttgcac gtatccctgt 1620 
gtaacaaacc tacacattct gcacttgtat cccggaactt aaagtaaaat aaagaaaaac 1680 
aaaataaaag aaagaaaaaa agaaaagaaa tgtatttgct ctgttattag aggctacaag 1740 
tccaaaatca aggtatcagc aggaaaatgc tccctccaaa ggctctcaga gagaatcctt 1800 
ccttgcccct tccagctcct ggtgactcct ggtgttcttt ggctcgtgca gcatcactcc 1860 
aatctctgtc acgtggcctt cctttctgtg cctgtctctc tccgtcttct tataacgaca 1920 
cggatcattg aattgagggc ccaccctact ccagggtgac cttatcttac ttaactaatt 1980 
acatctgcaa agatcttatg tcccaataag atcacaatgt gaattttggg gtggacatga 2040 
atttggggga aaacgctatt caacccacta caaatagcca ttctttctca caggagcttc 2100 
atgagggacc agaggataag gtagcacagg gcagggcttc tcaaactata agagccaaga 2160 
agcacctggg gatcgcatta aaatgcagat ttggagccag ccagcctggg gtgggtctgc 2220 
atttctcaca agctcccagg tgacatcctt gctgcttgtc ctgctttgaa taacgaggat 2280 
gtaagtgaaa gaaagtgctt tgaaaacagg tgtcaggtag ggggattaca tttcttttgt 2340 
aatgagtttt tctcacactc ttatttcccc acacctgaat cttgtgtgaa ctgtggaaga 2400 
acagaaaaag tgttttcatt tcagtcccct ccaaaagcag acctggagac aaggacttgg 24 60 
gttcaggtag tttattgggg aggcggaaat agggaaaggg agaaaaggga gagaagccaa 2520 
tgaagtgcca cattaaagag caggttgccc ttaaggcaac tggagctcca acctgcgggg 2580 
gaccactggg aaactgtggg gacactcctc agaactgtcc tgtcaaaggg ctggggcatt 2640 
tgcccaccaa ttcccctcct ccatcagttg aaagttacct gggagtgtca actcctccac 2700 
actctcccaa gtgctgcaga aaaccctccc tccagcagag aagttgcagg tgtccgaggt 2760 
gggaaatagt ggcatgctgc aaacagccct gtcccacagc taggtgacct cagctaggct 2820 
gaagagatgg gggagggagg ggggtcgtca tcatctgcta caaaaagctt ccaggatgtg 2880 
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cacgtcatct gtgtctctac tattctattt tgagtgcatg agtggctcag gcgagctctt 2940 
gcttacagtt gtgattatga ctgtcatcca gcaccagtgg acaccttgct cataggggtg 3000 
tcatggaaga atccgttgat ggcctgctga aacccacatg tgccccctct ccgctaggcc 3060 
tgccttcccg ttggtaaaat agtttatact atgggtgcat gtggaggaga acccacccca 3120 
gagtaatgta atcggtgctt cagggtcacc ttaggaagaa gatacttccc tgtctcacag 3180 
ttcctacctt cacctagttc agttgatctg ctcttgacat tcagtgttac ctttcatcca 3240 
cataacgatc ctatgaggta gtggctatga ttgtccccat tatatagcca aggacactga 3300 
aggccagaaa aaccaatgtc acttatcgaa ggtcactctg taagtggcag aagtgggatt 3360 
caatcacagt ctttctgatt cactagactc tgtggattcc ataggaaaaa ctaggtggta 3420 
ggtctcatga gatttaaatc acaaatctaa ctacacttct agggtgacct gctagatgag 3480 
aacagcaaga actgctttta tttgtatgca gcttcttgtc ctattagaac acagtgtgtc 3540 
cattagaagc tgattcttta cattaaaaaa gaatttacaa caagtaaaat gtgtgtatga 36 00 
gtcagtgtgt atttactttt tatgagtcag tgtgtattta ctctgtgttg gtatttactg 3660 
ggagggatgg attcattggg atgaatggat ttttcttaga taattggtaa atgaatagat 3720 
gccaactcta ataagcaaaa gatgccatgg agaaatgaaa taaattaatt caatgattca 3780 
atatcattag ttttcaaaac gtcaccagag gatttcctca actagggact tttcccagca 3840 
catcaaaggt aagattccac tcccacaagc atgtcaggaa cacacttatc cacaaaatac 3900 
gaaaaatcca tcttcttcta tatttggggc cattttctct cccttactaa gcagtgaact 3960 
tcccgaggga aggaagggtg taggttcttt agacttccct acacattgtg tgggcagggc 4020 
cacaaggtgt gtacgcaaag aaatttcaac ccctaaattc aggatgccag cacttgaatc 4080 
cacaaggctc ttggaggggg agggaaagag agtttttcta catttcttta attcttgatg 4140 
ctttggggga ggacacattt attgctcacc acggagaacc gagaaaagaa gagcccccag 42 00 
tctccgcctt gaaggagctg acagtccatt ggagaaacaa gtggaacacg ctggaaacaa 4260 
cccatgcatg atacaggaga gctggggaag aagcggctct gactgtaagt gctgaagggg 4320 
ccaaggggct tgggaaaggg agagatcagt gtggcctgag aggagaggag gtggaatgaa 4380 
aagagcactg ggctgggagt caggagacct ggcctctaga ccagttcctt gcctaattct 4440 
ctgtgtcatc tgatgtgttc acttctctct ctctctctct ctctctctct ctctctctcc 4500 
cccccctctc tttagacggg gtctctctct gttgcccagg gctggagtgc agtggcggaa 4 560 
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tcttggctca ctgcaatctc tgccgcctgg gctcaggtga tcctcccacc tcagcctcct 4620 
gagacactgg gactataggt gcgcaccgcc acacccagct aatttttgta tttttagtag 4680 
accctgggtt tcaccatgtt gcccaggctg gtattgaact cctaaactca aaccatccac 4740 
ccgcctcggc ctcccaaagt gctgggatca cgggtgtgag ccaccgcgcc tggccatgtt 4 8 00 
cacttctctc ttacagatca ccctcaacta tcttgcagtt tagtaagatg gtcagaactc 4860 
ctccagatgc atggcgatcc cccaggagca agaaggtcca tttattcgcc caacatcttc 4920 
tgagcaccat atgtgccagc caccatgcca ggcactgaga agcaaaagtc aaacagataa 4 980 
gctgaaatag ttgcaactcc ccacccagct actgtcactt ttgcatcttt gcttgtgttg 5040 
tttcattccc ctgcaccact ctcctcattt tgtccttcta ggaaaactat tgacccttca 5100 
aacccctggc caggaatcac cttgtttgtg aagacttcta tcttaggaca ttcaggttaa 5160 
acagaatacc atagactggg tggcttaaac aaatatttat ttctcacagt tctggaggct 5220 
ggaagttcaa ggtcagggcg ccatcatggc ctggttcttg gtgaaggcct gcttcctggt 5280 
tcacaggtgg ccatcttctt gttgtaccgt cacgtgtctc ttcttctagg gggctctaat 5340 
cccataatgg gggctccact ctcacaacct aatcacctcc cagaggcccc accttaatac 5400 
catcatcttg ggagtcagga tttcaacata tgaattttgg agagacacaa acattcagac 5460 
cacgacaact tccccgagct cccgttctac cctcatgccc agcagagtta cttcattcgt 5520 
cctctgtgct cctgtcagtg gacctgcatc agtgtcctta ctgtgctgga ctgagatggt 5580 
ctctggaaac ccctggagga cagggaccaa gacatatctt tgtacctagt gctggcaaaa 564 0 
ggcctggtcc agagcagtgg ctccttcagt aaatgtttgt caaatcacac tgaaccaaat 5700 
caaactgaat aggatcttca gctccatcag atttcaattt catttgcttt atggtcacct 5760 
ttgggaaaaa gggagcttag aagcatttgc tgactgtgac agagaggatg ccgagagagg 5820 
ggctcggagg tgtttgatca aatggaacat tacagaagag tctataaatg gagatcaggg 5880 
ctctgctacc acccctacct cacgctccag ttgacaaccc aaactccaga ggggcccatt 5940 
taataagcca ggaccaggca tcattggtgt ggcaactgca aagtttcact gggctgggga 6000 
tgcccatcat gctgcacata gctgtcctcc ctccccccag agttgtgcaa agcagcagac 6060 
ttgtaccaca gatgagcaga ggggtgaggc tgatcagccc atggtgggat ggcccttggc 6120 
ctccaccgtg tatctctctt gtacccagcc ccactatggg cagagggaga aggcagagaa 6180 
aaataataaa gagtaaatgg cagagtctac acacttacat ccctacttca tccttagtga 6240 
ccttgtaaat tggcacacag ggtccagggg agagcatgtg tcacagctca gttccagctc 63 00 
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agctatttcc acccctggag cacagaaagc tggggcaaca tcctggggca cagagggcaa 6360 
cacagtctcc atgttttggt cctacttttc tgtgctgcct tggaaatgaa tcatatgtga 6420 
ctcttcccaa ccagagaagt tgtgaacacc acaggtctgc catctgtatt catccagggg 6480 
cttgttgggg tcagtgcagc tgggataagg agaaggagca gaggagaaag ccccagtgga 654 0 
ggcctggcgg caaagaccgc cccttaccat gagacaccat aacactgtgg ccagagccgg 66 00 
aggccagtgc aagagtggga agaggggtcc agactagacc ctgctgcaca ctgttaatac 6660 
tccatccttc catgcagaaa ggcccagggc tgatgagtta aagcccgcaa tgaggatggc 6720 
aagcaggtga gtgacggagt ggacacctgg acaaacttct aagcaagtgt ctttgcaaga 6780 
accttagctt ttttctgggt acaatgttcc atgtccactg ccacctcaga gtttaggccc 6840 
cagaggaggc ccatgcatag gattcaggct caatctgaaa caggtctggg ctcagcctca 6900 
gtctgtggct ggcagcccag tccagctcac tgggcccgac gtggcctgcg tccatgtgct 6960 
tggcttcatt aacaccaagc tctgaccaac cccagtgacc aacccctggc ccagtctcta 7020 
ggaacagctt cagttaaaac tttgtaagat gccagtaaat agcacagtaa tgaaatccac 7080 
agatgagcta aaactggact gttcagaatc agccaaggat gagatggagt ctccaggggg 7140 
cccggcaggc tagaaatatg gtgaggaaat aaccatgtga aactcagcct gggaaggggc 72 00 
cagcaaacgt catgtgcatg ctgccaaatt ctccccacag ttagcaggaa gcgggagggc 7260 
tgcggggagc agctgagcta agggagccag agaggcggga tgtggtaggg gctcggagcg 7320 
ggtggcccca gtccagtcat ttgggagtga gtctgtagag acagtacagg gaccaaagag 7380 
caggcccctg cccacgtggg cccagagaaa ctccaaggca ggccagcagg accaaggagt 7440 
tccagggggt gcagagcttt aggcgggacg ggaagggtat tatggatgag tcatgcatgg 7500 
gacagctttg tgtcaaggta ccaagatctc tggtgctcac ctgtccgctg tggagccccc 7560 
agcaagccca gaaaatacta gaggcagcaa aaacatcagt gatcaagtca gacagaccca 7620 
ggtccgactc ctggccctgt tctgaacctt taaactagtt atctaactcc catcagcctc 7680 
agtttcttcc ttccttcctt ccttctttct ttctttcctt ccttcctttc tctatctctc 7740 
tctctctctc ttcttatttc tttctctctt tctttctttc ttctcactct gtcacccagg 7800 
ctggagtgca gtggtgccat ctcaactccc tgcaacctct gcctcccggg ttcaagcgat 7860 
tctcctgtct cagcctccca agtagttggg attacagatg cccactacca tgcccagcta 7920 
aattttgtat tttttttttt tagtagagat aggatttcgc catgttggcc aggctggtgt 7980 
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ggacctccta acctcaggcg atccacctcc ctcggcctcc caaagtgctg gaattacagg 8040 
catgagccac tgcgcccagc ctagcctcag tttcttcatc tgtaagacaa aatgttagac 8100 
tttcttcata tggtgtcctg aggacaaaat aagataatgc ttatgaagca ataagcataa 816 0 
ttgttagcac acagcatcag ccaagagaag gaggaggagg aaaccaatga cccctggtgc 8220 
tgctggagcc ttgtgtcctt ccttgtccct tcccctcaga gcctgacctc cttggggcta 828 0 
caaaccaacc tatcctctct tcagctggaa aataactgtc ccctcgaaac actactctct 8340 
gccaatgcct cattaacgag gcattgccac tttaatagcc aatcatggtc gactctttta 84 00 
ctccatccat aactcttgca tgagtggaga gagcctttac ctacaacaaa aatgttgaac 8460 
tttgagcttt tcctcaaaga tttataaaac aagtcctaat gtgttatttg taatagacta 8520 
catctcaaca tcgaggctgg ttaactttgg ttgttgtagc ccagtgaggg cccacactat 8580 
ggattccatc tgtgtggact actttccttc attctagccc tacccatggc ctcaaccata 8640 
attgaaccca aaccttggcc tcagcttgag cccccaagct ctaatctgcc tgcctcagat 87 00 
gaaatgtgaa aggctcaggc caaactcatc cccaccgtag ggaaaacaac ctaaagacat 8760 
ctcatggcaa tgacactggc ttattggttc cttctctctc aattatgagc actcggtgaa 8820 
actgaagttg catagaatag actcccccaa ggaatgggag acccttggaa cctgcccttt 8880 
aggaatttgt catctcaagt tggtaatgac aacagctagg aatccaaggg caagccacaa 894 0 
atgatacgat gttctctctt gttggatcaa tactaaagga cctgaggggc ctgctgccat 9000 
ggtgaggagg agccttcagg ggtacccagg ctggactgga tgcagggagt aggctgctgc 9060 
tctctcatag atccaagaag tgtttgagca ccaagtcttt ctcctccttc aatgcccttg 9120 
agttacacca acctccagca agagtttgac ccactggtaa gattcatttc ctgggaactt 9180 
ggcccagacc ccagcagaga gcctggcctt catctaggct gttgttttcc atgagcagtt 9240 
cttgtctgat actaggaagc gtcctgttgg attttccagg ccagctccac tttcccgcat 9300 
cttgtgcctt agtaagacct tgtgtccaga attctggttc agaacatgtg ttctgtataa 9360 
ataggaagcc tagaaaggaa atatgtagcc agctcttatc acctccatca tgtcaatgtc 9420 
ccctcaacac aaaggccttc tctggccagc tctagctggc tatgttcttg ctattaactt 9480 
tcttttaact gggtccttat gtctagctac ttcacagagt aaatcctagc tccccatctt 9540 
ctgcctaagg ggctgtgtct ccccatcagg ctgagagttc tcatgggcag ggcctatcct 9600 
ttccgccttt ttgtgttcct gtgaacaagg ctcttgcacc caatggacac ttttatttat 96 60 
tcattcatcc aacacgtatt tgctggccgc ctgctatgtg ccagtctctg agttaggaca 9720 
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ttgcagtgaa aaatcaaagt tccagccctc acgaacctct cattttagtg gaaggcaaag 9780 
gggaaggcct agaagggaaa gccaagagga ggcacttgtt caaccatcaa gccttgggca 9840 
cccaccgtgt cccaggcttt agactaggtc ctagaggcac aatgccacag cagagccact 9900 
tggccaggag gagccaccag ccaggccaga gttctggcca ccctcccaga ggaaatgttt 9960 
ccccccagga tcctggccga aggtggtgcc actggccctg gccagctgcc taccaccatc 10020 
cccacccctt cacccttttc tccaccacaa gcagcacagt tggctgtgct gtggagagtc 10080 
gttcatcagt acatccacaa ggagccacat tggtgtttcc ttgaaagcca aggagaagag 10140 
gtggttccca gctcagcctt ggagagaggc agcagcatcc tccctttaag ggttattgtt 10200 
aacatctcca ttgcctgcac tgaccaactg accagtgggg gagtcaaagc ccttggtccc 10260 
tccccagctg ctcccacaac cttcttgctc caagtggaca gtccagcaag gcaggtttct 10320 
gggagccagg gccctactgt ggaactggag gaagcccagg aagattgtct ggcctggggt 10380 
ttggctgaaa ggagagaaac aattgccttg agaacaaaat accagaactg aggactgatc 10440 
ccaagcaaga tggcaagata gtgatcccag gaggctgaag ggatcatgag gtgagcatgg 10S00 
aatcgcgctg ctcagatctc cctccaacag aaacttctgc aggtagcaga attaactgac 10560 
agccccagct gccacacttt tggatctacg gtggtgttca ctataggctg ctcccagcca 10620 
acgactaagt ccggcaggga taccagttcc agcccattcc tgcctgacat gggactcctc 10680 
caacaggcaa cccttgctcc gggatctccc atcaatctgg ccagactttc tcagagctgc 10740 
actgcagccc aagactcttc c.cactccagc ctcctccctt gcacctctcc ctcctcaagt 10800 
gtcagtccca cactgtggcc tgaaggtgct ccctccttgc tcctgcttcc atcttttatc 10860 
ctctgcagat ccccacccct tccccaccac ataaaggttt aatcccatct gatgtctact 10920 
tctcagagga cccaagctga cctaggaagc aaagatcccg tggatcagga tgagcaggct 10980 
aacggatatg tgtgccccct gcagccccaa aggaccacta cccactgcca gagaacattg 1104 0 
ttttcccagg tctgtgggag gagctgagtg tgggctcctt tccagtggcc tagaaggggt 11100 
cccagcctca catggctcat atcctgccga gaacctgcac caaccctccc ttcaactcag 11160 
gggtctccat tttcctcaag cagaaagcag tcttagaaaa ttctccccca acatttccaa 11220 
atatttattc attttaatta aaaaaaaaga aatccaaatg tacctcaatt tacccaagaa 11280 
ctgtgtcctt gagatgttaa gtgcaaatta aattgggggg aagtagatct tcttgtacat 11340 
gctccatggg cctcgcaatt aaagggagct ctgcagcaca tcaaacgtca gttcccatgc 11400 
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catgtatctg agttgacaat aaaactgatg actatgggta aatgacagta aaactggtta 11460 
actgtacaga agtcaagcaa acggcaacca cagcaggtcc ctggccctgg ggatctccca 11520 
gcccaccaga gagaccgaca ggatctactt ggagaacaaa ttcaatcaga ccttgatggt 11580 
gacagcattg ctcattcact agctctgaga agcaggaaag agtttagaca agaataacca 11640 
gggatgccac tgaaggtggg tgcccaggcc tgggctccag gcctccctac cactactaca 11700 
cagagctacc ctcttcctgc ctttccctgt ctgtcccagt tgctctaggg tcctcctagt 11760 
ctgtgccctg ggcctggcca catggttggc tgcctgggga ggccctggct ttgttcccag 11820 
taccagaggt tggggtgaag acatgcgggc tgactggagt gtggccagcc ctcaggccca 11880 
gctgcagggc gcagggccaa gttgacaatg ccagccactg gagacagagt ggcaactact 11940 
gctgtcgcct ccaaaatacc agttgaggtt ttctcctact gtggggttta gaaactattt 12000 
ccatttctat tatgcctttg ttttgcccta ttttctcttt tctttttaat agtgttggtg 12060 
gttgtgcctg gaggtgagac aggggagggc aggacaagag ccaagagaac gctgaagtca 12120 
gcagagtgtg ctctggtgcc tcgcagcttc tcttccagca ctgcccactc ccactgcagg 12180 
gcctgggtct caatggcagt ccctctcagg gcctattctc catggacaag aatcagcagg 12240 
gctcttactc caccctattt ttccccgatc atgacttcct gtaccgtggc ccactcttag 12300 
catccatctg tctgctggta aatgaggata ttcactctgt cctgggccac cttctcctct 12360 
cattgcacac tgtcttccta gacagtccca cccacaccca tggcctcagt taccacttac 12420 
atgccataaa ctccacattt atttctccag ccccttctcc cctctgattc ttagacctgc 12480 
atatccagct gctctctctc ccttagtggc tcacaggcac ctcaaactcc acatggccag 12540 
aactgacctc atcatcacca ctctacccag ccaccaaata aacacacccc tcccaaaaga 12600 
gaaaagtatc ttctcctgct tcagtcttct ccatctcagt gaatggcagt accatccttc 12660 
tagctgttca agacagaaaa ctgggggcga tcctgaactc tgccctttac ttcacccatc 12720 
agctatatcc catctatcag caggtcttgt caatcccatc ttctatagag ctctccatgg 12780 
ttttgatagc tccctagtcc cattgtcact cttttaatta agacctccat caactcattc 12840 
atagacaact taaattacct cttaatttgt ctttctgcct cagtatttac ctcttgtcca 12 900 
gttaagagac caccaggtgg ccaccagatt ttccttagaa acagatatga tcatgtctct 12 960 
cccctgttta aaagcctctg ctggaacttc tacttccaac cacattggag taaactgaaa 13 020 
cagacttttt cttctgccat ttaaaaaaac cctaaaaaac gtaccaaaat acataaaaca 13080 
gctatttata gacattggac aataggcagt gcaggactgt gatccctgag agaaggtgag 13140 
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gtgaagtcta tgatggcctt tggtttctgc ccaaaggcac tttctggacc ataatatggg 13200 
atggggaaac cccaaaagag catagtgatc tcctggagtt aatgagtaaa aatattagag 13260 
tttgggagac tgaggcagct ggaattttca ggtcagagtt ctagaaagga aagagcaaca 13320 
tagaaaaaaa gagttccaga tatctacata gggagcctct tgattgactt gctgaacact 13380 
aagctataca tacatagggt gacaccctac aaatccaagc aagaactacc aggaagcgat 13440 
aggctgaaat tcccagagtc ctgacaagta taggagacat acaatttcca gctagccaga 13500 
gtagagaaac cttatggaaa atccagggta ttcagtcaag gctgccagaa gagtcacacc 13560 
tcaccagtaa ggataaacta gacctagaat aaatgatact ctagaccttc cctaatcaac 13620 
cttaaaaaca agcagtaaaa ggcccaggcc gatatacaag aaacttaact gctttctaaa 13680 
acaaaactca agatttctta aaggaagaca caaaatctag acactcaaca atgaaacatc 13740 
cccaacgtct agaatccaat taaaaaatta ctagacatgt aaggaagcag agaaaatgtg 13800 
ctgcttccat agccaggaaa gaaaaaaaat cagtcaatag aaacagccag aaatggcaga 13860 
gatgatagaa ctaacaaaca aggactttta aacagctgtt ataaatatgc tcaaggattt 13 920 
acaagaaaac atggatagag tgaaggaaat ataaaatatg tattaaaata gccaaataga 13980 
atttttagtg gtaaaaaata caatatttaa aataaaaatt tactagatgg gcgtaacagc 14 040 
agattagaca ctgcagaaga aaagatcagt gaactagaag aaatagcaat agaaactacc 14100 
caaaacgaag cacagaggga aaataaaaag ttatttttct taaataactg atttaataga 14160 
ggctgataaa aagttaacag aatgaatggg taaacaaagt gcagtatatc caaacaatgg 14220 
aatattatct gtccataaaa agaaatgaag tactgataca tgctatcaca taagtgagcc 14280 
ttacactaag taagaagcca gccacaaaag atcacaattt atatgatttc atttattttc 14340 
atttgttgtc cagaataggc aaaaccatac agacaaagta gatccgtggt tccttagggc 14400 
tagaggagtc agagggatgg ggaattgcta aaggatatgg ggtttctttg gaggtagtga 14460 
aaatgttcta aaattgactg tggtgatggt tgcacatatt atgagtatta taaaagccat 14520 
tgaaatgtac tcattaaatg gatgaattgt acagtatgtg gattatatat tagtaaagct 14580 
gttatttaaa aaatactaat ggagtttcag tgacctgtgg ttccatatca cacagtctaa 14640 
catatgtgta ataagagttc taggaggtgc tggaggcaga aaaacatttg aataaataat 14700 
ggctgaaaat ttggcaaatc tgatgaaaac tgtaaactat aataatccaa gaaactcaac 14760 
aagcctcaag aagaataaaa atgaagaaaa tcaaaccaaa gcacagtatc agcaaattgc 14 820 
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tgaaaaccac tgataagaga aaatgttagt accagcttta gggggaaaaa aatgacacat 14 880 
tatatactgc ggaaccaaac taaagataac cacacgcttc taaaccaaaa cgttgcaagc 14 940 
cagaagacaa tgaagtgata tctttaaact attcaaagac ttaaactata agaaatgtta 15000 
aaggaagctc ttcgggttga gggaaatgat accagttgga aacacagact tacaaaaagg 15060 
aatgatgagt gccagaaatg acaaacatgt gggtaaataa gaaaatgctt gcttttctca 15120 
attttaaagt ttttatttaa aagacaatta attgttcaat gccaatgaat tgttcacttg 15180 
aaaatggcta attatggcca ggtgcagtgg ctcatgcctg taaccccagc actttgggag 15240 
gccgacagga ggatcactcg aggccaggag ttcgagacca gcctggccaa catagcaaaa 15300 
cctcttacta aaaaatacaa aaaattagcc agggctgtta gtgcatgcct gtcattccag 15360 
ctactcagga ggctgaggca ggagaatcac tggaacctgg gagatggagt ttgcagtgag 15420 
ccaagatcat gccactgcac tccagcctgg gtgacagagc cagactctgt ctaaaaaaat 15480 
aaaataaaat aaaatggcta attttatatt atgtgaattt cacctcaata cattatttca 15540 
aaaaatataa atgactgctt aaagcaaaaa caataacgat ataccttggg tttatatgta 15600 
gaagcaaaag gtattacgac aatagcacaa atgatgggag gtacaaaagt atgctgttgt 15660 
aaggttctta cttgtccatg aagtcatata atagcatctg aggtagattg tgatggttaa 15720 
aaatgcttgt tttaaaccct agatcaactg gaaaaaaaat ttttagctaa taagccaatg 15780 
acagagataa agtagaaaag taaaagattt ccgttattcc ttgatactta tctctcaagg 1584 0 
aggtagagct taattcccac ccccttgact gtgggctgaa tttagtgact tgcgtgtaat 15900 
gaatataatt tctaaaggga aaagcagtaa ctttacagtg gggaaacgtg gcagacacca 15960 
ccttgaccaa gtggtcaagg ttaacatcac cagtaagtca tgtcaatatc atatacccct 16 020 
gatttgatgg gatgacaagg gtacatcacc tcaatggtat tctttccaaa aatgcatatc 16 080 
cacagaccaa tcaaaagaaa atatcagaca aacccaaatt gagagacatt ctacaaaaca 16140 
catgaccagt gcttctcaaa actgtccaga tcgtcaaaaa acagaaaaac ctgagaaact 16200 
gtcatagcca agaggaacct aagtagacat gatcatgatt aattgaagtg ggatcctgga 16260 
atagaaaaag gtgtgaatgg gaaaactggg aaaatccgaa tgaagtctgt agtgtagtag 16320 
tattgtacca aggttaattt cttatgtttg agaaatatat tgtgtttatg tcaagtgttt 16380 
atgaaaggtt ggatgaaagg tctatgggaa ctctccattt cacaactctt ctataaatct 16440 
aaaattattt taaaacaaaa agtttttaag aaaataccag gccgggcgca gtggcttatg 16500 
cccgtaatcc caacactttg ggaggctgaa gtgggcagat cgcctgaggt caggagtttg 16560 
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agaccagcct ggccaacatg gtgaaatctc atctctacta aaaatacaaa cattagccga 16620 
acattgtggt gcacgcctat aatcccagtt actcagaggt tgaggtagga taatcacttg 16680 
atcccagaga cggcgattgc agtgagccaa gatcgtgcca ttgcactcca gcctgggcaa 16740 
caagagcaaa actgcatctc aacaaaaata aaagaaaata ctcaacccaa aagagggcaa 16800 
aaatagagga aaaagttgac aaagaataga aaggaagaat agaaaacaaa tagagaagtg 16860 
gatgacctaa attcaacaat atcaataacg acatcaaatg tgaagggact ctaagcagta 16 920 
caattaaaag ttagagattg tcctactgca ttaaaaagaa aattggactc aacatttgtt 16980 
tcaagcatat gttttaagca taaagacaca gatgggttaa aactaaaatg ataaagatgc 17040 
accaggagtg gctatgttaa tttcaaagta agctttaaga cacagaatat taccaggaat 17100 
agacatgttt aatgataaaa cagtcatcag gacaatataa caatcttaaa tgtgtatgtc 17160 
cctaataaca agcttcaaaa tacatgaact caaaactgac agaattgaaa taaatagaca 17220 
aatttgcaac tatagttgga gagttcaaca gttctttctc agtaactgat aaaataagta 17280 
aacagtaaat cagtaagagt atataagact taacatcttg acaggagtgg tggctcacac 17340 
ttgtaatctc agcactttgg gaggccgaag caggcagatc acttgaggtc agaagtttga 17400 
gaccagactg gccaatgtgg taaaaccccg tttccactat aataaaaata caaaaattag 17460 
ccaggcatag tggcaggctc ctaaaatccc agctaccaga gaggctgagg caggagaatc 17520 
acttgagcct gggaggtgga tgttgcagta agctgagata gcaccactga actccagcct 17580 
gggtgacaga gtgagactca gtctcaaaac aaaacaaaaa caaacaaaca aacaaaaaaa 17640 
cccacttaaa cagcaccctc aatcaacctg acctaattgg catgaataga acactgcacc 17700 
caatatctgc agaatacaca gtcttctcaa gcacatatgg aacattctcc aagatagacc 17760 
ataagctgga ccatcaaaca agtctcaata aatttaaaag gactaaaatc atgaatgtat 17820 
gttctctgat ctcaacagaa ttaaattcaa aatcaataac aaaaaagata tctatggaat 17880 
ccttaaatat ttggaaatta atgacacact tcaaaataat ccatgagtca gagaataaat 17940 
caaaagggaa attgaaaagc atttttaaat gaatgaaaat gaagacagca tataaaaatt 18 000 
tgcgggatgt cactaaagca gtatttaggg ggaagcttat ggcactcgat acctatatta 18060 
agaaagaaga aaggtcttag atcaatgacc tcagctttgt taaaagaaaa actctagaca 18120 
aattaaattt aacagagtta aattcagcaa agaccaattt gcaaatcagg cagcttccta 18180 
aaccggaata gatttagagt gactccagca ctgccatgtg gttggagaaa atttatgtac 18240 
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agaaaaagga aagtgatgta cagaaaacgg gagtgtgggc tgggcagagt ggcttacgcc 18300 
tgtaatccca gaaccttggg agatcaaggc tggtggatcg cttgagccca ggagctccag 18360 
accagcctga gcaacatggc gaaaccccat ctctaccaaa aatagagaaa tcagctgagt 18420 
gtggtgacac acgcccataa tcctagctac tcaggaagct gaggagggag gattgatcgc 18480 
ttgagaccca ggaggcagag gttgcagtga gccaaggttg tgccactgca ctccagcctg 18 540 
ggcgacagag tgagactctg tctcaaaaag aaaaaaaaaa aaagaaaaga aaaagaaaat 18600 
ggaagtgagg tacagaacca gctggattaa ttacagctca atgtttgcct tatttgaaca 18660 
caatttgaac agttggccgc ctgtgatttg ccaaaactcg gtgactcgta caagagcagg 18720 
ttacagtttg tttacacatc cagttaggtt acagttcact atgcacacag aaacctttag 18780 
gccgaactta aaacacgtaa ggaggcaatt tcatgctaaa cttaacagtt tcaagcttaa 18840 
aaagctagaa aaggaagaat aaattaagcc caaaataagt agaagaaagg aaataacgaa 18 900 
gataagaaca ggaatcaatg aactagcaaa tagatagaga agatcaatga aaccaaaagt 18 960 
cagtgctttg aaaatatcaa taaaaacaat aaacttctac ctaggcagta ttagttatat 19020 
gattgtatat atttatcaaa ctcattgagc tgtataccta taatgggtgc attttatttt 19080 
ctgttaattg tacccaaaaa catggacaaa agatttgaac tgagacttca ctaaggtaga 19140 
tgtaaaaatg gccaataaac acatgaaaag atactcaaca tcattagtca tcagtgaaat 19200 
agaaattaaa gccatcatga gatgctagta cacattcact tagaatgcct gaaaattttt 19260 
taaaaataac tatactaatc ggatataaat caatgaaaac tctacaaagc aatgaaaatt 19320 
acatttctgg ctggagtaca aaatggtaca tccattgtgg aaattagtct gatagtgtct 19380 
cataaaatta aacgtaagct aatgctttgg cccagtaatt ccacttgtag gtatttatcc 19440 
aagagaaata aaaacatatg cccacaacac acctcataaa agagtattca tagaagcttt 19500 
atttatgaaa accccaaact agaaatagct caggtgtcta gcaatgggtg aatgaataaa 19560 
caaattgtga tacatccata gattggatca atggtatacc aatacaatca tacatcgcat 19620 
accttataac ttttatggac atacaacata tacaatggca tattaataca atggatattc 19680 
agcaattaaa aggaataagc tactgataca tgttacaaca tggagaacct caaaagcatt 1974 0 
atgtaattga gaagccagac acaaaggcta agtaccatat aactctgttt atttgaaatt 19800 
ctagaataag taaaactaac ttatagtgac aggaagcaag atagaggtag tcaagggtca 19860 
cgagtgggag tgggattgca aaagtcttca aggacatttt tttgccatga tgaaaatgat 19920 
gtaaatctca atcatagcag tggcttcaca tacagggcca ctcctaaggt gctcagaacc 19980 
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taggaaaata tattttgtgc aatccctgtc tacacacaat atttgagnca tccaagacta 20040 
gtgtgtcaat accattctgg ttgacccaat ttcatgcact tctgtcaaag atatattgct 20100 
ctggctagtg gagcaatcca cttatcatgc tgctctcctg gaaccgggtc tcaaccaaaa 20160 
gtaccataga cgatcccata ggtgagaatc gtggagttcc ttgggacatc tggtgggccc 20220 
caagcacagg atgggggatg tctagagact cagagtgcct ggaaaaacac agaccaggtc 20280 
ctatgtctac gtgggaccca ttctgagcta aggaaagccc aacctgatga caccactgat 20340 
tccggggcag cagtctcaag ccgctgagcc tacgtgaata aagtcaaggc aactgtagca 204 00 
tggtgtgctg agacaaaggc tctggtttca gacattggca ggaaaccgaa aattttaagg 20460 
aaagtgctgc aaagtgcagt gggttcctcc tgaaatgcag ggcatgggac agaagcccca 20520 
tctgcccagg tctaaggtag tgctggttta catgggtgca gacatttatc aaatgaactg 20580 
tacactgtat cattgaactg tacacttaca atgagtgcat gttattgcat ataaattata 20640 
cctcaataga gaatacttac taccaagaaa gaaaaccctg tagtctcagc tactcaggag 20700 
gctgaggcag gagaatcact tgagcccagg agttcaaggc catagtgagc tgtgatcaca 20760 
cctgtgaaca gccacttcac tccagcctgg gcaacatagt gagaccccat cttggaagaa 20820 
agaaagaaag gaaggaagga agagagagtg ggggggaggg agggagggaa atgaaaggaa 20880 
aggaaagaga aagaaagaaa gaaagaaaga aagaaagaaa gaaagaaaga aagaaagaga 20940 
gaaagaaaga aagaaagaaa gaaagagaaa gaagaaaaag aaagagaaag aaagaaaaga 21000 
gagaaaaaga aggaaagaaa gagaaagaag aaagagagaa agagagaaac agagaaagaa 21060 
agagaaagaa gaaagagaaa gagagaaaca gagaaagaaa gaaagaaaag aaaaagaaag 2112 0 
aaagaaagga aggaaggaag gaaggaagga aggaaggaaa aagaagaccg tgtatctcag 21180 
attcccaggc cccattccag acaaaatcag aaagtcttct gtctggacac tgagaaattt 21240 
tcatctgttg gttctcagtt gcctagagaa ttacatccag ggtgcacttt ccagcccagc 21300 
cctgacgtgc ctcctcatcc tcagttcctg atactccttt tcttaacggg tgctccagac 21360 
acgctgaact tagaaccagt ccctaaaagt gctcacaccc ctttgcaaac cagtgcctct 21420 
acatattagg tttcccctgc ttgaactcac accttctccc tgcaaggaat tcctacacat 21480 
cgttcaaaat ccatctgaaa tgtcactttt tctgtatagc catctccgac cacctaacag 21540 
cgtcaagatg gatggaggtg ttatcggtgg agggtgtcca tgttctccgt gttttgaaca 21600 
aagaattgga caaaacacac aaacaaagca ggaaagaatg aaacaacaaa agcagagatt 21660 
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tattgaaaac gacagtacac tccaaagcgt gggaacgggc ccgagcagcc actcaagggc 21720 
ccagatacag aaacttctcg ggtccaaata cccactagag gtttcccatt ggccacttgg 21780 
tgttcacccc atgtaaatga agtggtggtc tgcaaccagt cttattgcaa ccaatcagag 21840 
gctgaagtga agttacaaag gtcacacttt tatgcaaaga tctgattggt tgctgtctgc 21900 
aaccaatcag aggctcaagt gaagttacaa tgttgcactt ctatgcaaac gaagacttga 21960 
cccgcaatca gtctgattgg ttgtggacag cctacagagg ctgaagtgaa gtttcaaagt 22020 
tacactccta tgcaaatgtc taacaaccaa tcataggtac tttcaatttc ccatcggcca 22080 
cgcagaaaag ctgggagctt gcaaagggag tagcctctgg tccttttgtt acttaggcat 2214 0 
ggaaagttag ggttctcctt tcaatttagt tctaggaact cagcgtgaaa cggccttagg 22200 
ttccctgcct ccagaatgta ttctcctgcc tcagagagag gatgatccct cttcagtgac 22260 
cccacaaagc ttgggacata ctttttatgg ctatgacact tgtcaaaagt ggactgttgg 22320 
tacctgtctg ggggctttcc ctcatatcta attgcaaatt cctccagggc agggagcata 22380 
ccttcttcaa tccagcacct agcacagtgc ctggcaacat aataggctct tgatacaagc 22440 
tgttcaatga acgaatgaat cagtgaatga atctcttccc ctttggggac tctttagtgg 22 500 
gagccttcat tgaagagtgg tttcaccaca gagtgaccta taagcaaaaa attttaagag 22560 
ctgggacagc ctgggttggg gggtgttatt tggtcaggga aatattttaa ccccgaagat 22620 
tatgtccctg aggtatttca gcctatccag gacttgtatt tatgagatgg ccagggtcag 22680 
ggctcattct gtggccaaaa ttaggtgtta aatacccatt atcatcacca tcatcatgat 22740 
tatcctttat ttggcattta tcatgtgtca catgctggta taaacactct tgtatattag 22800 
ctcatttaat ctgcacagta actctgggag gaaggcgcta ctgggttgaa ccataagaaa 22860 
ttgtcaatat tcaatctttt ttgacccaca aaatagcatt ttcatatagt tcaacctaag 22920 
ttaccgaatt cttttaccaa ggttcaagga ggtgagtccc ttgcccagat ttcaatgttc 22980 
atctctctgg ctccagagcc cttttgctat tttctctaca ctgccagctt ctatcagttg 23040 
taaaatccct gctataagcc agtccacagg gtctagtaag tcggacagga gcgggactgg 23100 
ctagggtttc aaatgctctt ttgcagggaa gatgaactga cttggcctga aaatgttagt 23160 
cttttagcgg tcgtgtttcc aattttgttt tagtctctca ctcatccaga atccacctcc 23220 
ctcaccacac acacacacac acacacacac acacacatgc acacacacag aagagccaag 23280 
cagctgactc acacaccatg ccgcagttgc cttgactttg ttcacgtagg ctcagcagct 23340 
tgagactgca gccccagaat cggcagtgtc ctcagttggc acccagagga gctgttgcat 23400 
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tctccccttt attgtggaga tggctgcagc aggtgctctt gtcctggggg aagacagcca 23460 
ttgagattgg cagaggctta agtatgaagt caaaggcaac acctgatatc agaaagggcc 23520 
caggcagcag tggggaagtc ctcagctgga tcttcgagcc ttggacggaa tcagcagcag 23580 
gcagggcaga ccccaacccc actgatatct ctgctgcttg ctcgccaacc ctgtgagctg 23640 
cccaccgagg caataggcct ggcagtgctg gaggctggga agggagagac atatagtaca 23700 
gtaagcatcg accccccagg aaaggctcac agtgacccac cccccttcat agtatgaggg 23760 
caatgacaga gcattgtccg gattgggggt gggcaggaaa tagaggccag ggaaatgcac 23820 
atacccagag cctcaggcat caggtgggcc tgagaggcct ggctgaggaa tggggggtaa 23880 
gtctggacat gcagacagac ttcccatgat attctacctt gagatgaatg tggtcaggag 23940 
gagagtccag gctggctggg aggttgatct ggggctacaa gccctgggcc ccaggagaga 24000 
ggaactggaa aggcttcctc caagaacccc agaagcctct ggcctttccc ggcataacta 24060 
gaaagagaag cagacagccc caggatctta ctcaatgaga gcttaaaggc ttgccttcct 24120 
gtttcctctc tcatgaaggg gcagcctgag atgccacgaa gagccctggg ctggaagtta 24180 
ggaggccgga gtgctagtcc cattcccgcc tgccagagcc atcctctccc tgggctacag 24240 
tgttctccat ctgcagaaat gatgaggcag gatgcgtcaa gtggggaagg cagttggttc 24300 
taccagaatc atggttttca aacttttgca gtttggaaca ctttctttaa acaaaatcta 24360 
acagggcaac tcaacgtgta aggtggcaga tgaaaataga gccgctctca acccattccc 24420 
ctccccagag gttcccctag aacctccatg gtagcccaga gcccctttta aagaccactg 24480 
ggttagagct gcgggcttcc aacttgagtg cgcaccagca tcacctggag ggtttgtaag 24540 
aacagattgc tgggcccacc cctaaagttt ctgactgtgt atatgggatg gaactgagaa 24600 
tttgcatttc taacaagcac ccaggtgatg ctgatgttgc tggtccaggg accacaattt 24660 
gagaactact aggttacatt ctacccaagg gccttcctgt gattcatgtc cagtcattca 24720 
atacttacat taatccactt cagatagatt ttctgtgata ttaaagcata ggagaacacc 24780 
tggtattgtc cctggcacat agcaggcact cggtcaaagc tggtagggtc cgttctaaat 24840 
gctctgccta gaagtctgtt taggtgtaca atcatgttag ccactggtca agggattcac 24900 
agaagcagaa gctttacttt taataagttt acagtctgcc atggcccctc cacccaccag 24 960 
gaaataacac ccccactaat tccatcatta gaagttactc catgttaaac attatccttt 25020 
ccagatattc tctttttttc caggtaaacc ccctgggaag gggtagttgt tcagtccctg 25080 
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acccctcagc attccttaga tgttttcccc atgtgaaggg gacatagctt cgatcctcta 25140 
ggagttcttc cagggcaagg accttacctg tatcattctc ccccacatac cccacggctc 25200 
ctagcccagt gtctggtaca gagtggacac tcagtacaag tttgcagaat agaataaagg 25260 
aagcatggga cccccaactt gacccagccc acgtgagcca ccagcacccc catcatggcc 25320 
aaggctgcta actggggaac tttggagggt acatggggag ggcaggtctc cagccacacg 25380 
cacctgggat tgctgcccgg gaggttttgc aaactggctc caggaatgtt cggggcctcc 25440 
cattccccac atcctccttt tcagcctcac cacgttctta aacaaattgg cactgaccct 25500 
ggtgacagac ttgatcattt ccctttgctt cgtgattgtg ttaagtgact gccaaaggcc 25560 
tgttcctgaa tagtcagcac attcctcagg acggtgagtg ggagcccatc tgctcctgga 25620 
gcgcttcctt tcctttattg tgctattttg gcctcaattt cccatttgga caggagcatc 25680 
ttccagccgg tggggaaagg gagagggtgg agaggggcgc cacagccttc cttcatttct 25740 
cactcatatc cctggtttct ctggaaataa atcaacaaat attatactac gtgcctggcg 25800 
ctgctcttag ctctccacat gtgtccatgg agctaatcct tacaacaccc actatgagga 25860 
aggcacaaat attaccccca ttttccagat gagggaacag agactgagag aggccaagtg 25920 
acttgcccaa ggtcgcacaa tatgtgatag agctgggatt tgaacctaga caggtctggc 25980 
tctggagact gagttcrcac ccattggctc taccaagggt gtggattctt gaggctgtgg 26040 
atctgggccc aatggtatct gaggccatgg ccctttccct cttcactgtg tgtcagcagg 26100 
ttctaaagtc acttagattg aacacagaat tccagcccgg cgcggtggct cacatctata 26160 
atcccagcac tttgggaagt tgaagtzggga ggatcacttg agaccaggag ttcaagacca 26220 
gcctgagcaa catagcaaga ccccattatt acaaaaaaaa ttttaaaccg gccaggtgtg 26280 
gtggcgtgtg cctgtagtct catctactcg gcaggctgag gtgagaggat ggcttgagcc 26340 
caacactcca ggctggacaa cagagcaaga ccctgtctct taaaaatttt tgaaagaaaa 26400 
cagaattcct tgcagatcag aaaaggttat cttcatacct tgccactgac ccactcgttt 26460 
aatgcaacat ttgaactcac agatgctaca ccaatgtgga ctctaccttt gggttataaa 26520 
gggctgatat agtaatgcta aacagctcat tgtgtgctct tctccatgta ccttgcctca 26580 
tttatgtctc acatcaactc taaagtaggt tgtagggaga agctgaggct caaagatccc 2664 0 
ccagccaacc acagacccag catgtgggca attcattact acactagatg ttaaacagcc 26700 
aggtagcttt tgctaggtgt caggtgttgt ctaaacactc tccctcattg attcattcaa 26760 
gcctcataac aatcccatga agtaggtact attattatcc ctgtctctat ttcgcagatg 26820 
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aggaagccga ggcacagaga ggtcaagtaa cttgcctgag gtcacacagc cagaaagtgg 26880 
gagatctggg atttgaacga agtctgtcag cttagactgt ggggttccta gtgtatgctc 26 940 
ttaaccactg cactgtgcta cctcttggca ctgaggccac ctctgggcac ttgaggtacc 27000 
atgataccat aaccccctcc atctcttctc cttgtcttcc tgctattcca aattgacttg 27060 
gtatttccaa taaccataat tctcagaggt tctggagttc aaattacagc actgcactca 27120 
ctagcagggt gacctctgtt tctttctgta ccccagtttc ctaatctgta aagtgggagt 27180 
aatatggtaa tgagcccata gggtggttgt gagcatgtag gtcaggcaca gtggattccg 2724 0 
ccatcaccac caccacctcc ggttgcatct tctccagccc agccccagca tcctgaaggg 27300 
ggaaatggtg cttcacgccc tttcacttat cagtctcacc ctccgtgtct gtctcttcct 27360 
gtctcaaatg aaaagcagcc ttcagttaag attccctagc cttttgaaaa cttggtgatt 27420 
acaattcctc ccaggaggtc aaagaaacct atttccctag cattctaagc aggaattcta 27480 
ttgtcatttg ttaactcaaa catggccaag gtggcccagc ccatctcctt tcgtccacac 27540 
ccataggaga catagagaat agataacttg actggaacag atcccaccag caccccatgg 27600 
ccagagtccc tgagaaagaa attcacagca aggaaatgca ccaattgcaa ataattattt 27660 
ggaatctgat gattccttgc aacgtcagtg actgcgagtg agcctctccc tcttgccgtc 27720 
tttacagaaa atgttgaggt actagacagg gaatataaat tttccagtat ctttccctat 27780 
gttcctaagt gtccctgtcc tataattcca gtcttgcctc gttgctactg gtgtcttatg 27840 
tcctcagctc agtggggtag agcatagcgc tacctataag caggcagtta ggattagaaa 27 900 
ataaagtccc cttcacttag ctcttcatcc acaaagcaag ctctaaccct ggccactgac 27960 
tgagcactaa ccccagccta gaatgttccc taaacctgtt cagagaatga gcctaaacct 28 020 
tggtaatggg ctgcccttag tctcattcac agactgagta tataattgca gagtgaagaa 28 080 
actccaacct ggagaggtaa agcaacttgg ccagcatcac acagaaccaa ttctctgtcc 28140 
cagccatata ttgaccctga attttagcca ctaagaacaa gattttaagc gtgtgggtgt 28200 
atcagtgcaa gtgacctcta ctagtaagat tccaactcag acaggtgagt gggcaccaca 28260 
tctttgtctc tgagggagat gccactgttc cagtcttcac ggagtactgg ggttattaaa 28320 
aagaattctt gcctagggag agtgatttgt atgatgaaag aaatcaataa attgctgttg 28380 
aatgcagtcc aacctcactt attcaaacta attggaggta gcctttcaaa ttagtgaacc 28440 
atctgtattg taggttactt gtaaagtaat gaagctttat tgttttgaat aattcacagt 28500 
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agctccaagt aagctaacaa gttgtagcct aatcgaggtc ttaggagact tgccttaatg 28 560 
aaatgaaaag aatcctcagc atcaatcatt tgtatgcaaa tggtcctacc aaggaaatga 28620 
taaagtttgc tgccttgaag tatagaaact cagtgcacag agtccattta ttcaattcat 28 680 
ttatgacttt gttgagagtg gacaaagtca agacagacag ggcccaagct atccccattt 28740 
ggtactcatg gagtgtcttc taatgaggtt ttactgtaaa aggaaacaat tggagagcag 28800 
caggtgcaat ccatccactg ataaaataaa agatggtttt ggaatactgt cacttccttt 28860 
catttttttt tcctaagaag acattagaaa aaaaagttac caatgtaagt caagtacctc 28920 
ggtgctttgt caaaacccgg aacaaacatt aaggggaaaa aaatctgaag gtcgaggaag 28980 
ggaaataaac aagaagagta agtaaagcat gaatcatcca ttaccacctg atggctcctt 29040 
aaaaataaaa ataaaactat ggaaggaaat agagtccatg ccacagtcag gcatgttgat 29100 
agggaacctg gcatggactc ttatttcgca gaaagaactg acctttggct tctaatttgg 29160 
attgcctttt cctcaaggcc ttgcagagca caaggttaga cgatgctcgc ctagcaccgg 29220 
ttcccaattc tttcttgggc actataagtg gagccaagga aacaatttta ttttcatgca 29280 
tttaatttaa ttaccccacc ttgtttccca aagactttga ggtggttaat gacaatgaaa 29340 
aaataatgac aaaataaaag cagagaatta ggaccaggga aaatattaat agaaagtttt 29400 
aaatcaagag rtaaagacct gaaaggtcat ataaagttgc actacaggtt tctaaaaaga 29460 
ggcacagttg ggattcaaac cagctcagtc caatggacca ccttcggaag gagcacagga 29520 
tccccagatg gacctatcta taagtgtgat cttgggctag atacacaact tccctgagcc 29580 
aatttcctca tctgaatgat ggacctaatc aatattgtct atttcacaag ctgttatgag 29640 
gactcaacga aatggtttct ttaagccatt gagctccact ggctagcaga gaacatgtgc 29700 
ttaataaatg gtagctatga gaatggatat gaagtctccc atgatgcaat acaaagaatt 29760 
tgcccataat ggcatgtcat ttaatcttcg caccagctca ggccttgggc ccaggatcac 29820 
agagaaggga gagcagagga aggagggagt ccttggggag gcttcctaca gaaggtggta 29880 
ccccagtcag agcttctgta ggataggtag gatatgagga gatggaacta aaggtgaaga 29940 
atgagctgga agtgtatgag gacagtaaga aaacctacta aagcagatgg ccaggtagga 30000 
aatgaggtta tagagaagag gatagggcag gatgggctca tgaacttgct tgaatcgggc 30060 
cagatggtga aggaatgtgg atgtcatttg acaggaaatg gggagtcatt gaaggatttt 3 0120 
gagcaggttg agcaggggaa ggatgtgaca tgtttgaagg gatgttttca aaggtggagt 30180 
ctgtctgctzg ctctctgctg tgtgcaggct gtggtggggg ttgggagaaa cagggactgg 3024 0 
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cagaccagct gcagtgatcc aaggaaggga catgataaga gctcaagcta aggcaacggt 30300 
gacggagaag atgggcggtt cccaatttca gttagaggca cctgagaggg atgatttgac 30360 
tttctttcta gactcaaaat tccatgagga cagggaccat gctctcctgg ttcaccactg 30420 
tacccctagg acctagcaaa gtgcttggca catagcaggc gcccagcaag tgttttatga 30480 
atgaatgaat aaatgaatga atgcctctat gcagacagac actgaataag aatggttgct 30540 
tgctcaataa caagcaacag tacaaggaaa acaaaataga gaattttttt aaagaaaaca 30600 
ttactttaac tgactctctt tcaccatccc tggactacag taacaatgtg gtataggaat 30660 
gcaactcagg agaaatctag agctgactgg gaaagaagcc ctttaaagca gaacctgtat 30720 
cagcagtgac gctctatagg aagcatgcca accatatgtc ccctggacag ttcgcacagt 30780 
gtagaaacat cacaatgcag gtaaattata gataggagaa gaaaaatggc agtatttacc 30840 
tcacaaaagg attaactgag cttccattta aagtagcaaa ctgcctttct agtgtactta 30900 
atttgagacc aggacttgac gtactgaaac tgggttccat ccaagacaat atagagtaat 30960 
tgtctatact ggtaagagcc ttgatttggg agtacagaaa cctaaccttt gacctaggca 31020 
agcagcagaa cgtggtggtt acattcacag gccgtggaat tgtgctgcct ggatttgcat 31080 
cctggtgcaa cactgactgg ctgtttggcc ttgggcaagt ggcttagcct ttctgtgctt 31140 
cagtttcttc ctctgtaaaa tgtgatgata caacacctga tcctggtcca tgaggttatt 31200 
gtgaggatta agcgaggtaa tgcatggaat gtgttcagca atacccagaa cgtttcagta 31260 
aatatcggct ctcactctta atcctaacta tatgacctta ggcaagccac ttcccctctc 31320 
tgggccttgg tttcctcatg agcctgaagc ttcttttatt cataccttag ggatgtggcc 31380 
cccacccgta ctacaaactc atcatctatc acacataaaa gcatctccac cctctggggt 31440 
ccccagttga taactctgcc catagaagga ccattgggat tcctagaaat gagcaaaggc 31500 
ccaaatagac aagactagga cagattcact tttcttagta tactttgtaa aagttccaac 31560 
tcacccactc agctcccact tctaaatctt agcctcctct tttcctccca agcagcctta 31620 
cttccttaat catcttcatc ccaggcccta agggaaagga tgctcaagag cactgctatc 31680 
ctctggacag ctcctctagc caaaccccca tttggatccc aagtttttcc tttaggtttc 31740 
tcctccccaa ccccaactcc aacccaggga actgcaatcg caccaacctc tgctgccctc 31800 
tgctgtctag ttccagatgt gctggctcca gttgggccag ttatgtcagc ttccaaggac 31860 
ctgtcagttt ttccaggccc tagctgccac tgtgacctca ggaacacaac ccagatgccc 31920 
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tctgcaatat ctgtgctctc ccttaatctg gacaaatgca cagtctttcc tattcagact 31980 
cgacagccaa gccacatctg gtgtagtcca tgagttcaga actgttattt tcagcaaaga 32040 
aaggaagctg ctagagccat ccatactctc atctaagccc ccttctcaaa cattctgcaa 32100 
tcatttatta cgcatttgct gtgtgccagg cactgtgcaa aatgctgagg atgcattctt 32160 
attgaatcct ggcatcaacc ttcccagtag gtaaaattat tattattact tccattttaa 32220 
agatgaagaa tctgagactc agaaaagtga atgtcttacg caaggtcaca gagccaatga 32280 
atggcagaga tggggcacaa aaacagatct acctagcccc aaagctggtg ttcttaacca 32340 
ctggccatag tgtggagggt agcttggagg ggaagagagt gaaagcagac taatcattna 32400 
ggatctatgg caatagtcca gggcagaaag cagaggaagt gatgatcttt agatacatgt 32460 
aggaggtaga cttgacaaga catggtgtca aggcaagggg gagggagagg tctgcaatga 32520 
ccctcagaat tctggcttgg gcaactgggt ggacagggac atgttggggc atgggggaaa 32580 
tgaagagatc cattttgcac atatggagaa agagacatcc aggtagggaa ggtcagtagg 32640 
caattttttt ttttttaatc ttgctgtgtc gccagtctgg agtgcagtgg tgcgatctcg 32700 
gctcactgca atctccgcct cccaggttca agcaattatc ctgcctcagc ctcccgagta 32760 
gctgggacta caggcacata tcaccatgcc cagctaattt tttgcgtttt tgtagagacg 32820 
gagtttcacc atgttggcca ggatggtctt gatctcctga cctcgtgatc cgcctgcctt 32880 
ggcctcccaa agtgctagga ttacaggcat gagccactac gcccagccag caattggttt 32940 
tatgagtcta gagctcagga aagagacctg ggccagcacc gacttggaga tgggcgagag 33000 
tggagtgtaa ttcatcagaa gaaaagaggg ctgaggccag gacatcaata tttaagggga 33 060 
agggagaata aaagaagtgt aggaaaaaga ctaagaaaaa gtagccacag atgtaggaga 33120 
accaggaaca attgtcaagg aagtcaaaga aaaggagagg ttcaatgagg gaacagttct 33180 
gctaagttga ctaccacaga gaagtcatta gccttaaaat ggccctttag atttgggaag 33240 
tagaaggctg ttctcgttat ctattgccat gtaaaaacaa aacaacaaac aaacaaacaa 33300 
aaaaccttag tggcttaaag caacacaaat catggtatxa tcctatcatc tcttatggtt 33360 
tctgtgggtt agaaatttgg ggtgggcaag gaatctggct cagggtctca cctattgttg 33420 
cagccagaga ggggctggaa gctagaacat tgggtggcgg ggagctgggg gctggcctgg 33480 
ggctggctgg ccttctctct ctctcttcct gtggtctcag ggcctttcca tgccacttct 33540 
ctgcaggggc tactctgagc ttcctcacag cctggaggtc tccggggagt agacctgctt 33600 
gtatggcaac tgaagaccaa gagaaagagt cccatgtgct acagacagaa tgctcgtgtg 33660 
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cctccgaatt catatgttga aaccctaatc cccaatgtga tggtattcga agatggggaa 33720 
gtttgggagg tgattagagg tcatgagggt cgagcccacc tggtgggatt tgtgcactta 33780 
taagaagaga ccagagagcc ccatctctgt ctaccccctc cctccctctc tctccacctg 33840 
tcatatcagg acatagcaag aagatggcca tctgcaaacc aggaagagtg ttctcaccag 33900 
acacctgatc tgctagcacc ttgatcttgg actcctcagc ctccagaact gtgaaacata 33960 
aatgttcatt gttcaagcca ccccatctat ggcaatgtgt aacagcagcc caaactaact 34 020 
aatacaccat gtttctgtac atcagcaatg aaaaatccaa aaggaaatga agaaaataat 34080 
tccatttaca gtagtacata aaagaataaa atatctatca ataaatttaa cccaggaagt 34140 
gaaagacttg tatacagaaa actacaaaac attgcatata gaaattaaag aagacctaaa 34200 
taaatggaaa gctcactttc accacctgcc attggtcaaa atactcataa aaccctgccc 34260 
agcttcaagg ggagtggaca aagacatccc cataacctgc catctctcag tagcgaacac 34 320 
cacaaaggtc attaacgcct ctagcaagag ctcctttagt ggagtggtcg gggctcagat 34380 
cagaccacta tgggcgagga taggcataga caactttaca gaagggcata caatacctga 34440 
atcaatgaaa agacaaggta tattccagga tgtttttaaa aactcagaca ttttccccag 34500 
attaatccaa acatctaatg caatatcaat ttgaatcata acaggaattt ttataggact 34560 
tgatgtgctg atcctgagat catctagaga aggaaatatg caatatgaca gagaagtgtt 34620 
taaaaaatgg gcaggaagtc ttccactgcc agctatcaaa gcacaacaca agctatagta 34680 
atcaaagcag tgtaatgttg gcaccgcgac atagatcaat ggaacaaaag agtctagtac 34740 
aaccccacgt atatgtgaga ctataatatg cactacataa ggcactgaaa atcaatgaag 34 800 
aaaagatata ctcgtcaata aatggttacg ggacaattag tgtcaaattg ggggtggggg 34860 
gttaaggcac aatattctta aatctgtacc tcatgtcata tccaaataca attcccaaat 34 920 
ggattaatga gctaaataca cgtgcactca caaaaaaatt atatgaaact gtgagaatat 34980 
ataggacagt atteccttgt taagtaagtc acaagtccca gaaattataa gagtatcttg 35040 
ataggttttt ggctacacaa aaaaaatttc agtctctaaa attaaaaaca aagttaaaag 35X00 
ggagaaaata cttgaggtat ataaatattt gtcaaacaga taaatgaaga atgcacataa 35160 
tatataaaaa gatctcattt ttaagttaat aagaaaaaaa caaacaaccc aatgagaaca 35220 
tgggccaatg aaaggaatgg gtactcaaag atgaagaaaa aaataaaagg caaataaaac 35280 
ataaggaagc atagtcaacc tttaccagcc agcaggtaaa cacaaagcag caagagacca 35340 
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attttcaccc ataaaattgg taaaacctta aaccattgat gttatccagt gttgaagaca 35400 
ttgtggaaac agctatttta ttggtagaag tgtgatttgc tacagctttt cttggagggc 35460 
agtttggcag aatctattga catgacaaaa ttgtacgctt ttctacttct agatatggat 35520 
cctagagcaa tgcttcatgt gcgtgaattt tcatggcccc tgtttgtaac aatgcaaagt 35580 
cagaaacaac ctgaaggtcc atcaatcatg aaatgcaagc agggtaggga tttgaggaca 35640 
aggataaagg tctggaatct cccacaaggg caacagaaaa tacagctgac tcaggacagg 35700 
gaaaaggatt gccaagcagc accaagaggc ctgccgaggt tggacaccaa aaatttggag 35760 
tgatcccaac caacagggtt gggatatatt ctcctgtaga gaagacagat ggttcaaatg 35820 
ccctggggtt tacttattta ttcaataaaa agttatgaat gcctactaag ctccaggtag 35880 
tgttctaggc cctgagatat aacagtggac aaacataaac aaataggtac acaatataat 35940 
gtcaggaaaa tactctgaag aggtaataat acagaatagg aaataaagag tgacagaggc 36000 
actgtttaaa taatgaggtc agggaagtcc tctctgaaaa ggggacatct gaacagagga 36060 
ctgaatgaag cgaaggaatg agccatgttg gcatctgaaa aaaagagcat tcctgttagt 36120 
gggaatggta tatgcaatga ccctgggatg ggagcatgct ttacttgggg atggacattg 36180 
gcaggataca tggagtggaa gcacaaagga acaggagagt taaatgtgct agcaaaagcg 36240 
aggttggagt gatgggttat gtagtttgag ccaaatagag agggaaggga aaccaagggt 36300 
tgctgataag tggagaggaa gagcgcaggg caaggaacag gatgttttgg tgaagtcaaa 36360 
aatcaggtgc agtaagagtg aaggagagaa gcacaggagg gtgtgttcgg agaatggggg 36420 
gttggagtztc agtatgtcaa aggacagtga tcccaagcag tggcaaggcc caagatgcta 36480 
cttcggttgt gggaggctga ggaggagtgg aggagaaagg gccttgagct gaggagcttt 36540 
ggacttgaag aaggaggaca ttagatgggt cattgacagg aaccttgata tgactcatgg 3 6600 
taatggcaga catgctgcag agaggcagcc atgagctagt tgctgtggtc ctttccacga 36660 
ctgaggaagt gttgaaaggc agatgacagg acctgtgaca aaggcagtaa cagaactaat 36720 
aggggaagtc aacctcgcca ggaggtcaag gaaagcttcc agaccaaatg acatttgagg 36780 
caagactgga aggaagagga gttcacagag taaagagagg aaggaagggt gggtattctg 36840 
ggtgaagaga aatgaacaaa gaccctgttg ctagaggaga cactgtaaga gggactcaaa 36900 
gaatggcata tggctagagt ggagatggca aggagagggg cataagatgg ctgagaagta 36960 
aagagatcag gctcctgaag aggacagaag gttgggagag tgcccaaaga ccaccaaatg 37020 
aagggttgag tgaaatcaac tagaaccttg gtgagcccca gtggtgctat cgccatgcat 37080 
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ccttatgcag tgcaaggccc ctgagagggc agcataggtt tgtgacatgc ctggctgggc 37140 
tttggactca gccagggctc aggatctgac ttagtctgtg accactgaag cgaaccaagc 37200 
taactggcct acacggggct ctagccagat ccactgggtc tcactagcaa agttcctgaa 37260 
ccataatcac tgataatgtt tgaatatatg tctctttcaa atctgatgct gaaatgtaat 37320 
ccccagtgtt ggagatgggg cctgggtgga ggtgtttggg tcacggggca gacccctcat 37380 
ggtttagtgc tgtcctcatg atagtcagtg agttctcaag tgatcaggtt gtttaaaagt 37440 
gtgtggcatc tccttctcca ctctctctgt tgctcctgct cctgccatgt gagatgccta 37500 
ttccctcttt gcctcctgcc atgattgtaa gtttcctgag gccctcccag aagcagatgc 37560 
cagcactatg ctttcctgta cagcccacag aactgtaagc caatcaaacc tcttttcact 37620 
acaaattatt cagtctcagg tatttcttta tagcaatcca agattggcct aatacagaaa 37680 
attgatgcca aagaagtgca gcattgctat aaagatacct gaaaatctgg aaggggcttt 37740 
ggaactgggt aatgggcaga agttagaaga gtttggaggc ctccaaaaaa agacaggaag 37800 
atgagggaaa gtttggaact tcttagacac tagttaaatg gttgtgaccc aaatgctgtt 37860 
tgtgatgtag acggtgaagt ccaggctgct gaggtctcag atggaaatga ggaacttatt 37920 
gggaactgga acaaaggtca cctgtattat accttagcaa agaacttggc tgtatcacat 37980 
ccatgaccta gggatctgtg gagttttgaa cttcagattg atgatttagg gtatctggtg 38040 
gaagaaattt ctaagcagca aaacattcaa gaggtactag gctgattcta acaatctatg 38100 
ctcaggtgtg ggagtaaagg aatgacttaa tgttggaatt tttatttaaa tgggaagcag 38160 
agtgtaaaag cttggaaaat ttgcaccctc actatgtgat agaaaagaaa agcccaattt 38220 
caggggatga attcaagcag actgtggggt aaccacttgc tagagatatt tgcataacta 38280 
aaaaagacat ttcagagatc taagaggcag cccctcccat cacaggctct gaggccaagg 38340 
tggtttcatg gtccaggcct agggccctgc tgccctgagc agcctctgga cactgcttct 38400 
cgcatcctgg ccactccagc tccagccttg gctgaaaggt ccccagatac agctcaggtt 38460 
gctgcttcag aaagtgtaag ctgtacgcct tggcagcttc tgtctggcat taagcctgta 38520 
ggtgtgcaaa gtgcaagagt gaaggaggct tggcagcctc tgcctagatt tcagaagatg 38580 
tataggaaat ctgggtgccc aggcagagcc tgctacaggg gcagagccct catggaaaac 38640 
ctctgctagg acagtgcaga ggggaaacgt ggggttggag cccccatgta cagtctccac 38700 
cgagggcact gcctagtggg gctgtgggaa gggggccccg gtcctccaaa cctcagaatg 38760 
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gtagatccac caacagctcg caccctgcac ctggaaaagc cacaggcact caacaactgt 38820 
gaaagcagct acaggtgccg aacccagaaa agctacaggg gcagagctgc ccaaggcctt 38880 
gggagcccac cccttgtgtc agtgtggcct ggatgtgaga cctggagttg aaggagatta 38940 
ttttggagct ttgacattta aagactaccc tgctgggttt caaacttgca tggggcctgt 3 9000 
agccctttat tttgcccaat ttctcccttt tggaatgaaa atgtttactc aatgcctatc 39060 
ccccccatag catcttggaa gtaaataact tgttttttat tttacaggct cataggttga 39120 
aggaactcat ttccagatga gactttggac tttggagtta atgctgaaat gagttaagac 39180 
tttgggggac tattgagaag agaggactat attttgcaat gtgaaaagga catgacattt 39240 
gagggaccaa ggtggaatga cgtgcccctg ccaaatctca tgttgaattg taatcctcag 39300 
tgttggaatt ggggtctggg gggggatgtt tgagtcatgg gggtggatct ctcatggttt 39360 
ggcgccgtct ttgcattatc aagtgagttc tcacaagatc tggttgttag cctgaccaac 39420 
atggtgaaac cctatctcaa ctaaaaatac aaaaattagc ggggcatggt ggcatgcgcc 3 9480 
tataatccca gctactcagg aggctgaggc aggagaatcg cttgaaccag ggaggcagag 3 9540 
gttgcagtga gccaagatca cgccattgca ctccagcttg ggtgacagag cgaaactccg 39600 
tctcaaaaaa aaaaaaaaag tgtatagcac cttcccttca ctctctctct tgctcccagt 39660 
cctgccatat gagatgcctg ttccctcttt gccttcttcc atgattgtaa gcttcctaag 39720 
gcctccccaa aagcagatgc cagcactatg cttcctgtac agcctgaaga actgtgaacc 39780 
aattaaacct cttttcttta taaattaccc aatctcaagt atttctttat agcaatgcaa 39840 
aatggtctaa cacaatcacc aacagatttc acaagaagaa ggaggagcca tgggcagctg 39900 
tgagctgctg aaaaggagct gaccactgga tccaccaacc acagcgaagg acgtccaacc 39960 
ctcgccctga ggacagaaac tcccttctct gatgcctggc tgtctgggtg aggtcttggg 4 0020 
cctcatagca ccaaagatca aatgccatga aatgagctct ttccccctgc tgtgtgacca 40080 
caccctacac acacacacac acacacacac acacacacac acacacacac acagaggcag 4 014 0 
ggtagctttg tatcaaagca gggcagatgt gatggatgag gcacccaaca gcaccacagc 40200 
agccttctgg ggctggaaaa ttgccctcca ttcagagtat tcctttcggg ttaaggatgg 40260 
ggaggctagg ccacacaatg gtggggtttt gcatcaactc cacaaacttg tgctgagccc 4 0320 
actctttgct gggctctgtc ctggatagga cctgttccag gtctaaacat ttataaaaag 4 0380 
caagctcctg cccttgggga tgggtgacct acgacagaag agttgttgag aaagaaaaag 40440 
gtcaatgtca gggtttgaaa acagaagctc ccataaaatg ttatgggagt ctccaagatg 40500 
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aatttgaggt gatgagggct tggtagacaa gacgaatcct gagctggact ttgatggatg 40560 
gagagaatgt ggacaagaag agatgaataa tgttagagag gaagaccttg gccactaagc 40620 
tgaagagatt gaattctgcc ctgagggcag tgggagccat ggaggatctt cagattgggg 40680 
tgactgtgag gtccaaggga cagagcatgg gcttttaagt tccccagacc caggtctgag 40740 
tcccagctct gccacaggtc agcttggacc tgggcaaagc cacttttttt ctccaagcct 40800 
gtcttctctt ctgcttctga gagccctgcc tatctcccag catgagatgc tggaacctga 40860 
tgggagggta taacgggaat tcggaagcag caatcacatg ctgtgcccac aaggccaaga 40920 
gtccccagtc cctcagcctg tgaggtcaca tgaagagtta gattgggcat gtgccagctc 40980 
aaaagcctcc ttcccacctg cagaactgca gaacctcagg tatcatttca tgatttcatc 41040 
ctcagtaact tgggacatcc ccaggggatg ctctgcagtg ctcatagccc ccttgggtct 41100 
ctgactttcc tattgccacc ggctattcca gtccatggcg ctgtccacct gtgcttcctc 41160 
aggctgcttt tcacattatt gccgccagtc tgcccctaga ctgctgtcct tccctgtgac 41220 
cacctcctgc aatgccatct ctgcacctct actgccaaga ttcactccac ctctctgatt 41280 
cctggccctc ctccaggttc tacccttccc cggggctccc acagcaaagc tgtctgtatc 41340 
cctggtgcta gagttccttg cttgctcgtc actttatatg cattttctca ttcacactca 41400 
tgacattagt tgttattgtg acctcagctg cgattcagga cacctagtgg ttgtcacccg 41460 
gtgggagaac atgtaagtca ggaaggtgta ttctgatgga agtcaaggag gcattggtgc 41520 
ccagagaggg caggctcagg ttggaagcct ggggtggcat gagggtgcaa aataagaggc 41580 
acagacagga ggtactctag agaagatcac ctctgatctc agaatttggc ctttaggcta 41640 
gctttggacc aaaaagtaaa acaagcagaa gtccactgcc tgagtaggcc agagcgagac 41700 
tccagtgatg catgcatctt ccaaacaact ttgatgcaaa gactcggcca ttcaaccctc 41760 
cccacgttga atcatttctt tccacttcaa gcctgttgcc aactcgggtt gaagtctctc 41820 
aactgataag agtaaagaca gtggtttcga gatagggaga taaggaagga gctgatgaag 41880 
gcaagtgaag ttatctttgc caccagggct aaggctggga cccagaggta tgaaaccaga 41940 
gggctgagca ttatgtgggg accatccctg ggcagagtcc ggaagatggg ggtggaatcc 42000 
ccgtttagtt tgtttgcttc gggggttggc ccttcagctc agctgatctt tgctctaaca 42060 
actgctattt cagcagtgct ggatgccagg gagagaaaga acaaagtaca gatgagactg 42120 
gctttattta gttctagtat gtgaactaac aaacatttgt agaacattta ctcttgtctc 42180 



90 



BNSDOCID: <WO 9911799A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/1 1799 PCT/US98/18426 

tgcgaggcat gttctgagag catcaccaac attagcccat tgaatcctca taacaaccct 42240 
aagaagctag tactattatt atgccaactc acaggtgagg acactgggga ctcagggaga 42300 
taaagtcact tgcccaagat cacacagcta gtgggtgcca gagctgggat tgaaactcag 42360 
gctgattggc tcccaagtca gtgtttcaag gctcagttca ctgacagaaa gacaggacca 42420 
tgtttgtcct caaaggctat cttcaggggt cttcccctgc aaaatcccat cagctctacc 424 80 
tgcttgcttg cccttcagga tggctactgc agcaaacaga ggaggcagga gtctttgagg 42540 
gggcattttc cagtgctctg gtctctggga ataatttcat taccagtcct ggaaggggtg 42 600 
ggagagagga gcctggggaa aaggctgcgg gagcttcagc agttattgaa tgacagcatt 42660 
ctcatgttct ccttgtccct ggtctgtcct ctgaggctgt gggctcctgc cctctgcttt 42720 
tattcctgag aatgaagttc actgggccac ctcactcatt ttgactctgc cccttcagag 42780 
actcccatga aggtccgttt tccctcttca aaatggccta ttttgtcatt cttgaatgga 42840 
tggttgaaag ttgctttaca cttgatgtgg acataagcag actgtaaatc cattaaaaat 42900 
agaaacaaaa ggaatggtgg ccaggcatgg tgacacgcac ctgtggtctc agctactcag 42960 
aaggctgagg ctggagaatc acttgagccc gggaggctga ggttgcagtg aaccgagaac 43020 
atgccactgc actccaccct gggcaacaga ataagaccct gtttcaaaaa taaaaataga 43 080 
aacaaaagga atgggcagtt tgcctctggg gcctccaggc taggtgagca tccatcccca 43140 
ttgggtctgt ttgtctctct aaagcattcc tctctgggaa atcaccttcc tcgaacccaa 43200 
agagcaaggg ttctgtagag catttactag gagctgcctt gaagctgccc tctgattcag 43260 
ctgggcgtga agacaaggca ggcctcagct tctccatttc cctcattggg agttgtgttg 43320 
tttacgagtg actcacactt cagggtaacc aaatccctgt tctagaagct acagcagaaa 43380 
cagcatttcc agttcaccaa attagtcttc tcagagcgcc caaactcccc tttctgtgtc 43440 
ttcttcccta ttcagtggca tcccccaaac cctccagata ggggcatctt tgggctttca 43500 
tttcctccca ggtgtgagtg gctctaacag cctcggttcc ccagagtgca ggttcagaaa 43560 
atgcagggga atgggactcc ccagatccca agggaagagc tggcatcact tcttcagagt 43620 
gggcactcct ggggggactc atcccagctc aagaaagtaa ctctccagtc acaaccaact 43680 
ttcccaaggg gtacagtggg acttgccaag cactacaaga agagaggaca ctggcttttc 43740 
cagtcagctt ggaggaagag agggaggaaa aaaggagggg gaggaagagc aaaaaggagg 43800 
aggagcjagga aatggacaag gaggaaaaaa aaaaaaagga ggaggatgag gagggaaaag 43 860 
gaggaagaag aagaggaagt ggaggagccc agaaaaggag gggaaggagg aaaaaggagg 43 920 
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aggagaagga aaacgaggag gaagaagagg tggggaggaa aaggagcagc agcaggaggc 43980 
aaatctccat ccccacagca aaagcagtgc tggagccaga gcccagagtg tgggagctaa 44 040 
tgggaatcag cttgctggag ggaaggggac cgaattaagg aatggctggg gctctgccgc 44100 
tgagaggggg ctgggaaaag caggctgatt gagaccagct gttgtgcctc tgtctctgag 44160 
atctttggac tctgcccagg atagcctcac accctatcct acacgactag gaacttgcac 44220 
agtccgcctc gggcagccca aagctcctct gcccaccctg gctcccagag ccctccaaaa 44280 
caaaagacca gagaagcact ctccacccag cagccagacg cctccttctt gacgccagcc 44340 
cccaccctct gtctgctcga gcccaggaaa ggcctgaagg aagaggccgg ggaaagagcc 44400 
ctccctctct cccttgtccc tccatccacc cagcgccggc atctggagac cct 44453 

<210> 6 
<211> 45546 
<212> DNA 

<213> Homo sapiens 
<400> 6 

atggcccggg ctcactgggg ctgctgcccc tggctggtcc tcctctgtgg tatgtgcatc 60 
ctagcttcca ctggaaggca gctctgacct ctcccctctg agctcagaaa gggttggagt 12 0 
gagggttggg gcccgagtct ctttttctgt tgcttcctct ctctgacttg aggaagagac 180 
acctcagggc cagtgttggg ggccctcata acttggatcg agtctggttt ggcacctttc 240 
cattcccccc gttatagaaa aaaaatattc tgacactcgt taaaacggta aggaaaactt 300 
aatttaagac taccgcattg caatagagga gagaaatggg gctcaactct gattacagaa 360 
aagacttcca gggatctgta gccaaggagc agagtgaggg ggtcagtgga tggaaaagta 420 
ttaagaggag atagcaaggg tagggggatt cttgctaaac cgactgaaca gcattcttgc 48 0 
tgacggcagg ccagagtgat cagatatcaa gggtgtctaa actgacttag caagattctt 54 0 
gctaagactg ggtgatgcaa gcctggcaag ggcaggacag acacagaagg ccaaggtcaa 600 
ggccaagtgg agaagaggct ttgggggagc ctaactaaac tttggtcagg aagagagtct 660 
ttcctctttg tcattcctgc ggcaccctct ctgtcctcaa ctaggtgccc agcccagcgc 72 0 
tcccacccta gccttccttt ctttgttgtc tgtcagcagg gcctgatcag tcactcaatg 780 
tccagttcct gagcacctac ccagtgccag gccttgtgcc agggaccaca gagtcattca 84 0 
gctgcagagc ctgctcttgg gagccacagc cctggcctca gagaacagca tattctggtg 900 
tccagagaga cataagttgg ctgtgtcctc attcttatgt tagcatatca gtgccaacat 960 
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tttgccaggg atctgtttct acagaaatgg gtgtttgttt tcttaaatga acagtatggg 1020 
tcgcatctga tccttgagtt tatgagacca agataaaatc acacaggacg ttcatggtgt 1080 
tgaaccccag ccaccctcct tgcatctgta gctcagccca ctcgaaagtg tggctgggcg 114 0 
tggtggctca tgcctgtaat cccagctctt tgggaggcca tggcaggcgg atcacttgag 1200 
gtcaggagtt cgagaccagc ctggccaaca tggtgaaaca ctgtctctat taaaattaca 1260 
aaaattagcc aggcatggtg gcgcacgctt gtaatcccag ttcctcaggg ggggctgagg 1320 
caggagaatc gcttgaactc aggaggcaga gattgcagtg agccaagatt gtgccactgc 1380 
actccagcct gggcaaaaga aggagactcc atctcaaaaa aaaaaaaaat gcgggcgcgg 144 0 
ggggcgttgg gcgcagtggc tcacgcctgt gatcccaaca ctttgggagg ccggggcggg 1500 
tggatcacct gaggtcagga gttcgagacc agcctggcca atatagtgaa acccccatct 1560 
ctactaaaaa tacaaaaatt agctgggtgt ggtgggtgca tgcctgtagt cccagctact 1620 
tgggaggctg aggcaggaga atcgcttgaa cccgggaggc ggaggttgca gtgagctgag 1680 
atcgccccac tgcactccag cctgggtgac agagagactc tgtctcaaaa aaaaaaaaaa 1740 
ggaggggggg cgggggagtg taatgtctcc ccacactcag ggccccatta ccatctaggg 1800 
aaatccctcc ctaaaagaca ggggggttag aggcaaggac tatcagagac cgctttgtct 1860 
aattcaagcc cctcctttcc agaggtccag agcctggaga acttagtcca gtgcctggca 1920 
cacagtaggc gcttaggaga tggttgctca atgaatattg ggggggggtc actctcgagt 1980 
cactctaaat tgggggcaga gcccaaacta gaaactagat ctccatatat tctgtccctt 2040 
ctgtggcatc acctttttca ggcccacctc tcatacttgg cagagtggga ggactgaaag 2100 
tgcgctcacc tcttttgtat taagatggcc tgggttcaaa tccctgttct accactgact 2160 
cctggtggca caagttagac aagttaccaa atctatctgg gccttgactt cctcctctgc 2220 
agaaagtgaa caataatttc cacctcatag tgatgtcaca aatatcaaat gagatactgt 2280 
agatgaattg ctgcgttcag tatctggcac ataatagggt gtctttaaat actaattaat 2340 
ccatcttt cc ccaccatggc agactaggat ttctgtctaa tatattgaaa atgagcctgg 24 00 
aatgataggc aaagatttgg actgtaagaa acagtgcaag caggccttga attgaagtca 24 60 
acccaatgta tcatacaatt agatttataa aaggctaaaa tgaaggtggg gtgattattt 2520 
agcgtgatct tctagagcat ttcaaagaaa atttgataac aaaaatttaa tttacacaca 25 80 
acttttctac ttcactaaac cctgaaaaga cccaattaga ttagaagaaa aattgattta 2640 
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atgagcacta aaacccatcc aaaagtggca atcacttgtc tatgaaacag atttctttga 2700 
ttgtgaccca gggtgctatc cagttcagcc tgatttccta tgaaatgtga ttatttttat 2760 
cctcaagtag agctggccag cttcgtgcgt ttcaaaggat ggaggccccg tggggcgggc 2820 
tgagaggata ctccttccct ctctaggaac tagctggtcc ccaaatgacc ctggattttt 2880 
ctcccggctt ctagcttgtg cctggggcca cacaaagcca gtggaccttg gagggcagga 2940 
tgtgagaaac tgttccacca acccccctgt gagtgcccct tggcccgggg gctgggccct 3000 
tgtggcccca caggcacccc cccacccctg ccccacgcac ccctgcaccg cagcaccccc 3060 
gccctgctct atcatcttct ctatccctca ttgggtcagt tggtagcccc tcctcctacc 3120 
aggctttctc ctgatttccg gactcttggg cccctccccc ctccactccc tcccctccac 3180 
atcagtgccc tcccctccct cccattcatt cccccagcag gttatctgct tccgtcctga 3240 
tctgggcctg ggaaacccct ctaggaggag ggaggagggc aggaggcccc taactggcct 3300 
ttggctgagg ccaggcagag gaaagaaaag acaaattcta gagtgacagc aaaaagagaa 3360 
gcacagaagg gaagagggag gaagacagag gttagaagag ccaacggagg ccgagcacgg 3420 
tggctgacgc ctgtaatccc agcactttgg gagaccgagc caggtggatc acctgaggtc 3480 
aggagttcga gaccagcctg ggcaacatgg tgaaaccccg tctctactaa aaataaaaaa 3540 
attagccagg catggtggcg ggtgcctgta atccccgcta ctcaggaggc taaggcagaa 3600 
gaattgcttg aacccaggaa gcagaggttg cagtgagcca agattgtgcc actgcactcc 3660 
agtctgggcg acagagcaag actctgtctc aaaaacaaaa caaaacaaaa ccaaaaccaa 3720 
agaagagcca acggaagggc aagagaggga ggcctagggt gtgcagacga ggaaaaggcc 3780 
cagcagaagg ggcttgggta gtcagcttcc tagctggaga ccagagtgat tctttctaca 3840 
atcagagtag ggctaagggg taggcatgcc agcagcactg taagccatgg gaaaaacgca 3900 
gtacatctgc tagcaatatc aactttatgg gacccctggg gcaggcaggt gggtgggcaa 3960 
accacattac ttttatatta aaacaaacat ggatatatcg tggagtagaa agcaaaacag 402 0 
cacaaacgga aacatgcttt tcgttttaaa gctaaaatga cctgtaaaga aatagagaac 4080 
tggtctggaa agggtctttc tgggcaatat ctcaaactgc tgggggcatt cattcactct 414 0 
cctctgccat ttgggaccag tgggtggcaa cctgaaaaag ccctgccaag ggaaggttgc 4200 
cagataaaat acaggacacc cagttacatt tgaattttag ataacaaata agaatttagt 4260 
gcaagtatgc cccaaattgt gcatgtttta atataatact tacattaaaa cattacacat 4320 
tgtttatttg aaatgcaaac ataactaggt gagtgtcctg tatttttatt tgctaatctg 4380 
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gcaaccctac ccccaggtag ccaaagaagg actgtctgca tggcctggcc caatgccaga 4440 
gctcagaccc ctgcagttct cacccacacc catatcccag actcaggtcc cattactgtc 4500 
acaaggggtc cctttccaaa ctctccaaca cctccatctc cacccagccc aggcatctta 456 0 
atctagacta agcagtaaca tcagttgcca ccccatccct tcccaagcca ggccagccta 4620 
acttccccca accgctgctt caggagaggc tgggggctct tctggaggcc aggggcccag 4680 
atgtcatttt ctctaagcaa cgtgtgtatc tgtttgtttg tgtctcaatg tcttcttgtc 4740 
atctgtcagt accttccagt tactgtggtc aataccacaa tgtcactcac agccctccgc 48 00 
cagcagatgc agacccagaa tctctcagcc tacatcatcc caggcacaga tgctcacatg 48 60 
gtaagagaca gcttctctcc cccttgccct ctctgctacc ctgggtcaga gaccacaaac 4920 
aggagctgtt aaaactcaga agatgaagac agagaaagga tctgatgggc aaagggaggg 4 980 
aagtagagag catggacacg gtagttttag ggcaagtgaa agaatgtatc cctctacata 5040 
ggcgggcaga cattgccaaa cctccttaac ctcaagaaat taaactataa acaacttcag 5100 
aatcaatttt catcaattcc cagaaggcag atccagaatg aatgtttagg gtatccctgc 5160 
cctaggaggt tagaaggtgg gctggaaatt gggctggaag aactccaagc tcccttctgg 5220 
cccaggcctt cccatactct gtggaatgga gtagcaaaat aatccaaaat cctgggttaa 5280 
gggagcgcgg ggttggttgt ggaggggggc acccatccca ccgtttcccc tcactagctc 5340 
caccctccac accatccacc tttccaacac cagcccattt ggaagagagg gcatttttag 54 00 
tgtcaatgag cttgtttaaa ttaattttcc cacatgctgg gaagggctaa gaacgttggg 54 60 
taagtatctg aacttccctg agcctcagtt tcctcttctg caaagagaaa tgggtctgta 5520 
atgagatcta tctccaagga ttatgaggat caaaatatga taatgtgtaa agccctttgc 55 80 
agaagttatg atgttcccag tctactgtgc ttgactcagg agggagaagg attagctgga 5640 
ggggaaaacc caaacgagcc aaaagacagg accctagact gcatagaatt tgcagggcag 5700 
ggcaacaagt ctcctttgca gaacagtcca gtgttacact gagtccagtg tgaacagtgt 5760 
gggggcgggg gagttgaact gcaaaagata cggttgctca aaggaaggga agatattgaa 5820 
ggctgggaga gttacagagg gcttcctgga agaggcactt ggtgtgcttg ggcttgtagc 5880 
tgacaagggg ttggagccga cagacagaaa taactgggat gtggttgggg gcctttcatg 5940 
tgggatctga taccacgaaa aaggctaatg atggtgttgt tgatttccta gaacgagtac 6000 
atcggccaac atgacgagag gcgtgcgtgg attacaggct ttacagggtc tgcaggtgac 606 0 
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aatcattacc cagccccatt gcttttgttg gtagatccag aggtggtcac agaggaccta 6120 
atgtggctag tgtctcagca tctgggaccc cagaacctac tgtagagaaa acccttctac 6180 
tctctctgtc tccctccacc accccaaaac catcagattc cccagggcac atatctcata 6240 
gcccaccagc cactttctgt gtagaaatga ggcagaggct gccttcctgt gctcattacc 63 00 
tctgctgcaa ccaggcccag tccagcactc ccaggctccg tttctaaata gttgctactt 6360 
ctacctctaa ccgctaagaa cccgctgatt cctttcacat gagggctcat ttggagacaa 6420 
agttttcctt tgccggttgt tttacggaca acatcacttc acatggccaa atgagacaca 6480 
aacatataag cccttgatga gatcacagtg tctgaagggg cctcgcaccg tgatcctggc 6540 
ccattgaatg aaatccaagc ttcctttcct gatcctcaaa cattccccct tgggcttcaa 6600 
actaccctgc ctcgtgcctc gtaggctctt tttccctttg ccaaagtctc tggattattc 6660 
ttgcccagca gccctcagct gagcctctgg tttagcctcc ccatcagcca ccgagtcttt 6720 
acgctgagcc ccatctttcc tgagagcgcc tacatgcagc caaaagttga cctcacttct 6780 
gctgaaagtc cacaagcagc cctcaacaca aagcagaggt gcctgattca ggacaccttt 6840 
ctgccagctc cccccacagc tctttttaag atctccttcc tcacttcttt ccaatggagg 6900 
agagaatctc tttccagagg ccccttgtgg cattctcaga gccagcactg cattgcacat 6960 
ccatcagcta atgccacgtt cctccccttc accctcacct gcaagtttct ctgttctgcc 7020 
ccaggaactg cagtggtgac tatgaagaaa gcagctgtct ggaccgacag tcgctactgg 7080 
actcaggctg agcggcagat ggactgcaac tgggagctcc ataaggaagg tagaagggcc 7140 
gcatggattt gttccccaag tcttgggacc tggactaggt tcaggaaggt ataggtgaga 7200 
gcgtgcatgt aagaccatgc tgggcctcta tggggagctt aggaaatctg aggccatcac 7260 
tgactttcaa ggctgatctc aaggaagaca cacatggtag gaccatcaga cagaaccccc 7320 
ggctggagag cctggggctg gccctgaagg tgacctctgc attgcttcct atctttcttt 7380 
cagttggcac cactcctatt gtcacctggc tcctcaccga gattcctgct ggagggcgtg 7440 
tgggttttga ccccttcctc ttgtccattg gtatgctctt ccttcagtcc ctgaatttgt 7500 
ccatgctaac gagggtgact cagcttccct aggatataaa gaaatggacc ttggtagaag 7560 
gaggggcggt gggactataa agatagagca tttgaaattg tagttgcaga atgttttgca 7620 
atgaggatgt attcatggat tgtgtaatta aaatacattt aaaagaatca gtacaaatat 7680 
tttaaaattc atgagtgcag agacccaacc tagaagactc agtgaatcat taaagagcta 7740 
gtggatgaat aggccaggat gtgcttgaac ccagaaccat cctctgcgcc ctggaacccc 7800 
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gccaatgatg atgctggaat cacatggtaa 
aaaagagttg ggagggcttc tgagtttaaa 
ggagttctgg ggcctcagac agctggcaag 
tatgtactga aaggcttgtg ctttagaact 
aacatcacca ttccttttga ctgtgacact 
agaaggacca cccacagcca ctggcaatga 
cagtgttaga gtaggctcag ggcattaggc 
ccccagcgtg ggcatacatg ccgggggtgg 
ggcagctggg ctgcaaaggg aggcaaaggg 
tttttcctgg ggtgtaaatg ggcaagggac 
tgggagagtt atgatctggc cctccaaggc 
aatcttgtgg acctggtatg gggatcagag 
gccctgcagg aggcattcac aggtgattca 
aacgcaggtc ccggctgcta ttccgtaggt 
agtggtttgc aaaccttagc atgcacctga 
tttcagggcc ccctcccctt gaggagtctg 
aatgttgggt taaagaatgt taggatttga 
gggaaacaga gaaagatgag agaaaatatc 
tatgatttgc ttgcctggtg gcagagactg 
ctgaccctct tgtgatgaag aatttagatt 
agtggacaga ggaagtaaat aactaagttc 
aggttttgct ggaccaatca ggaaggttag 
ttgacactat ctaaatccat cacaaaagaa 
actaaagtcc tgaccaggac cagcctactg 
gtcctatctc cctgggaatt ctgaatgagc 
caagccaaga gatccctttc cgcatgcaag 
aatacagcag gagggagatt tcaaaatagc 
catttttctt tatctgttct gatgtaactt 



ccctcttctg gaaggtagct tagaagggtc 7860 
aaaaaaaggg cccctgagaa tacaactccc 7920 
cccattcccc caaaggggag tcctcatgaa 7980 
ggcttctctg ccagcctgtc caatggcctg 8040 
gaagccctga taaaataatt agtgcccaag 8100 
aaaagacaag cttagccaat ttgtggcagt 8160 
ttggaaagca gaaagaggat gagaaacata 8220 
ggggtgggca ggctgctggg gcaggctccg 8280 
aaccaggact aactttgcct gaatcacaat 8340 
aagtgactcc ttcttgtctc tgcagacacc 8400 
tctaacagac agctggtgtc catcacaacc 8460 
aggccaccgg ttccaaatca acccatttat 8520 
gtaagcccag tttccttccc aacttgtagc 8580 
ggcaaacttc atcatcagag gtaccaaagc 8640 
gtcacctggg atgcttgttt aaaatgcaga 8700 
aatcaaaatg tcctccggcc atatttagaa 8760 
aacaagatgg agcaggaagg gaggaaagaa 8820 
caaccccttt caagagagaa aaaatcatga 8880 
atccataatg aatcattcag acctttactt 894 0 
tgtgagatgg ctggggtcag gtgggccagg 9000 
agagtcattc tgtagcacag attacagttc 9060 
aaaaatataa catcaataac cttgaccccg 9120 
cagtatgttg taagattcct gtaagtcgta 9180 
attatgaatc agtgtcttgg ccaccaacca 9240 
caatgaactt ggattcctag taacattgat 93 00 
tccacataat ttctcactta gaaaggagcc 9360 
agccatgacg aaacaagtga aaattgtgtc 9420 
ccagaatcat tctttctgca tcttgacacc 9480 
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atttttaaag aaatttgtta tccaagactg ttaaatcata aataaaactc atcttattca 954 0 
gataaagaaa gtcaaagtta tgtcgggaag gggaagggaa aaaagaaaaa ccttcctggg 9600 
gctggaaggg ttttgtcact tcccaaaagt gaggggatgc cgagtcatca cccgctgcca 9660 
gtcttcaaga aaaaggcacc tggagcttga gtttggaagc cgaagaacag agttcagctt 9720 
caataagctt ataaagacga gagagacttt ccagccactt gcagatttta aaggagaaag 9780 
gagagtgcaa agggcagaac atgaaaagtt cgctactcaa acatacctgt acgcaccaga 9840 
ccattgaaat ttgggttcat ttggtttctc agaaaggtat catattcctg aaccttgaaa 9900 
acataattgc aagtgaaaga agccagtcag agaaggccac atattgtaca attccatttg 9960 
tatgaaatat ccagcactgg caaatccata gagacagaaa atagattagg gatttcctgg 10020 
ggcccagggg aggggggata gggagtgact ccttagtgag tacagggttt cctttgggga 10080 
tgaagaaaat gttctagaat tagatcgtgg cgatggttgc acaacattgt gagtatgctg 10140 
aaagccactg aattgtgcac tttaaaaaga ttaaaatgat atgttttatg ttatgtgtaa 10200 
aatacacata acataaatta tgtgtaaaat atgtagtaaa ataaacctac aaaaatgtat 10260 
tgtacccaaa gctttggtca ttgttgaata gttttacctg ttcacatgaa atatttttca 10320 
cgacattaag ttatttatgc tcatatctga acccaactag gacccttaat gtttatggcc 10380 
ttcggccagc aaactcctag tcctgcctag gggtagactc tttgaatatc ttgcaggggg 10440 
cagggataga agatgaaaac taaagtcata gaacccagat ataagcatct gagaaaggac 10500 
ggtgctgtga cacagagttt aaggacagat ttggagactg agaatgttat caagacagta 10560 
cggtgtctgt acagattaag agcgtgggcc atgcagtcag acagaccaga gttcaaatcc 10620 
cagctctzacc tcttgcatcc atgaaaccct agacaagtca ctttacctct ctgaagcctc 10680 
agtttcctta tctgtaaaat gggatgggta agaattagaa caatgcacat ggagtgctta 1074 0 
gcccagtacc cgtggcttag taaatactgg ggttactgca gagaaagttg aacacggccc 10800 
cagccctacc cctttgtcct tcaactagga ctgctctctt gatctcagtc tgcatgatta 10860 
tttattaatc agaaagatac aagagttact acatctcaag ataccattct gtgtgcttta 10920 
caaatattaa ttcattccat cctcatagca gccctattga ggcagatgct tattatzactc 10980 
catccccatt ttacaaatgt gcaaaatgga acacagaggg gttaggtaat ttgcccaagg 11040 
tcacccatcc agtaatggca aagccaggat ttgagcccca tctgggtcca gggcccatgt 11100 
cattaatgag tacacaagta agagtttgtt tgaggaaagg gtttcgctgc tttttaagag 11160 
gacgcaaaac tgttacccta aggctgacct tccaggaacc gagtgccaaa ggcaaggtct 11220 
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gtcacttaca cttttttggg gctcctttgc ttctagggag cacttggcag gagaaagtat 11280 
ctggcgtccg aagccagatg cagaagcatc aaaaggtccc gactgccgtc cttctgtcgg 11340 
cgcttgagga gacggcctgt gagtgtggat ttgcagacat gggtgggcgc ctgggtctcc 11400 
ccaatgcccc aagcctcccg ggccctgcag cacagagcta gctcttcccc aaataaccat 11460 
gtgcccacca attcttgaga actccatggc cacagcccgt gggaaccaag aagggtaagg 11520 
gggcaggagc tcatggcttc agaaaaagga caaaaggtag ccctgagcaa gctgggacca 11580 
gcccacaggc caccagagga acgaactgac tatggcagca aagtgcttct gtcccactaa 11640 
ggggaggagg accaggaaat gcctcccaac aacttagcca ctaccccagg gcactcacac 11700 
tcccagcttt agcacccagt gggactctac tctttccgac atggctcagg acttcttgtc 11760 
tgatccctat ggctgggctt cagatattcc agggacccac tggaactgtg tgctgactgc 11820 
taagtatctg catttttctg gggggaaaag atccataact ctcaacagca tctcaaagac 11880 
gcccactccc ccaccccccc cccccggcaa aaagattgag taacatgcag aggtagatga 1194 0 
tacaagcatg ctgttggtaa tctttggagg ccaaaggaaa cagtggggca aaagtgtcag 12000 
aaatgcttgc ctctggccgg gtgcggtgac tcacgcctgt aatcccagca ctttgggagg 12060 
ccaaggtggg cggatcacct gaagtcagga gttcaagacc agcctgacca acatgttgaa 12120 
acgccgtctc tactaaaaat acaaaaatta gctgggcatg gtggcaggca tctgcaaacc 12180 
cagctacctg ggaggctgag gcaggagact tgcttgaacc tgcgaggtgg agattgcagt 12240 
gagatcttgc cactgcactc cagcctgggt gacagagcaa gatactgtca aaaaagaaag 12300 
aaaagaaaga aaaagaaaga aaggaagaaa gaaagaaaga aagaaagaaa gaaagaaaga 12360 
aagaaagaaa gaaagaaaga aagaaaggaa ggaaggaagg aaggaaggaa ggaaggaagg 12420 
aaggaaggaa gggaggaagg aaagaaggaa ggaaggaaag gaaaaaaaga ggaaaagaaa 12480 
agcttgcctc aggcagatca gcattaaata ttccttgtca attctcttcc cagggctctt 12540 
caaccttcga gccagtgaca tcccctataa ccccttcttc tattcctaca cgctgctcac 12600 
agactcttct attaggtatg gcttttcctt agcttgctgt tgtggacttt ctccaacttc 12660 
caccctcttg atgccccacc actgatcccg ccttaatata cagccctctg gctgcccatc 12720 
agctcggcgc ctgctgcagc acgacccttt agaaaacccc ctgttgtctt ttcctgactc 12780 
tttaaacctc tgtccctatt gaatcccaaa tctggcctgc ttggctccct ggggctggct 12840 
tcctttgacc tccaggaaca gagggactgt gactgcctct ggtcctttgc atccttagca 12 900 

99 

SUBSTITUTE SHEET (RULE 26) 

9911799A2 I > 



WO 99/11799 PCT/US98/18426 

gatgctcagg accctccttg tcttcccacc ccacccaggt tgtttgcaaa caagagtcgc 12 960 
tttagctccg aaaccttgag ctatctgaac tccagttgca caggccccat gtgtgtgcaa 13020 
atcgaggatt acagccaagt tcgtgacagc atccaggcct actcattggg agatgtgagg 13080 
atctggattg ggaccagcta taccatgtat gggatctatg aaatgatacc caaggtgggt 13140 
ttgccaggcc ccagcccaag ccaggcacca atccccactc taggcctgag aagtcttaac 13200 
ctaaagtgag gtgaagcccc tcagccattc agtcatctgg tcagccaaca actgtgcagt 13260 
gagttccctt cttggcctaa ctgggaagag ccagaagagg aaacagggat ggccatccag 13320 
agagctcata aaattgtcag gaggctggag agtatacggc ctgatctagc agagactttg 13380 
gagtcagcca aacttgaaaa tgaattccag ctctgtccct aactagctgt gcagtgactt 13440 
tggacaagca acttagccaa tctgggtctt tttcttcacc tataaaatgg caataagagt 13 500 
cctgacctcg caggagtcgc tcagtggaga tctaaaggag atagtgtata tgactgacac 13560 
atggtagtgt ccaataaatg gtagctgtca tgatatgtac aggaaactta cactgtcagg 13 620 
acagcctgtc gagtgactca ggccagactc tgatccatcc cagagtgctg gtgggatccg 13680 
aagacaggga gttcattgcc aagcaaggca gtgtcaatca aggaagcctt cctggaagga 13740 
gagagggttc agctggacgt ggaaaactag gaaagacaga aagcacacac agagtagaga 13800 
gcattgccac gaaatggtgg caaatctcac gtctgctggg tcctcctgcc agtcctctga 13860 
aaacacccct gtgccatgga gacctccttt actgtgcccc cagacctgtg cccccagacc 13920 
tccttttctc ctccctgccc ccaacaggag aaactcgtga cagacaccta ctccccagtg 13980 
atgatgacca aggcagtgaa gaacagcaag gagcaggccc tcctcaaggc cagccacgta 14 040 
agtccacgtt caggcagaca tggccttttg ggagtatcca gcctaatgag ttagaaagga 14100 
aaggcctcca gaagagcctg aagaagaccc tcaatgaaac aaagaagcaa gctgggcgcg 14160 
gtggctcacg cctgtaatcc tagcactttg ggaggccaag gtgagaggat tgcttgagct 14220 
caggagtccg agagcagcct ggccaacatg gtgaagccct gtctctacta aaaatacaaa 14280 
aattagccag gcgtggtggc gatcacctgt agtcccagct actcaggagg ctgaggcacg 14340 
agaatcgctt gaacctggga agtagaggtt gcaatgagct gagatcatgc cactgcactc 14400 
cagcctgggc gagactaggt caaaaaaaaa gaaagaaaga aagaaaaaga aataaagaag 14460 
cagacaccaa attattcctg ggcgcccaca tgcacctcat atccttgccc ccaatatagt 14 520 
acagtggttt agagcacagg ccctggagct agggtgcctt gagttcaaac cacagctcca 14580 
cccaattcct aactgggtga ccttgaccaa gtcattttac ctctctgggc ctcagttacc 14 64 0 
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ccatctgtaa aatgaggatc ataacagtaa 
aatgaatatg tatattgctc tgagcaccac 
agctataatg atttcctcac actggcaaga 
gttaaaatta agataaacca cctcatttaa 
cacacacaaa gataaagaaa atagcatcat 
taatccttga ctcatggcca ttcttgtctt 
accaccccct ctgctgctgg attcttttaa 
ccgtcaacac agaactcttt caaaaacagg 
aagaattctt tgacatcaaa tagccagtgt 
ctgatctcca tcatcttgga gcctgtggct 
ccctgcttcc ccaactccag gtgcgggacg 
tggagaagaa cgtgcccaaa ggcacagtgg 
agttccgagg gtgaagagcc acggccgtcc 
tgggggcagg gagggggaat ttgtcccctt 
gcgtgctggg tgggggaggg ctctggagaa 
gagttgagca gggggcgttg gatgatattt 
gggccacaac ccaatagcgt cccctagtgt 
ggggtgctgt ggctcagacc aggtcagccc 
gaggattccc gaaggcatgc ttggcccagg 
gatgctct cc caggactggg ggagttttca 
tccgtaaaat ggaggtggag aggaaataat 
tctataaatg acaggtcagg gcttgaggag 
caaatggtgt ccagttgaga ggtagaggca 
tcccgggccc agcctagtcc aggggcccat 
agagaagaac agttctcctc cggacccagt 
gctgccctgg cccactacag gtacttgagg 
atcctgttgt gtgtgtagag gaacagggtg 
cacgttgaag gtccgaggcc cctagccctg 
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tcttacagca tatggttgtt gtgagaattc 14700 
ctagcacatt ctaagtgtta aataagtgtt 14760 
ggatggctca atttttaaaa aagaattctt 14820 
aaaaaacttt gtctcatagg aaattttaaa 14880 
aatctccagt gtcccttcat ccagcttcaa 14 940 
gtctataacc ccatgttcat taccaccaca 15000 
aagtaaatcc tagatagtgt gtcatttctg 15060 
aacacactat gatcatatat aaacactaac 15120 
tttcaaagct ccacatcctg gagccgctat 15180 
gcaaccagac ctcacccggc tcctctgttt 15240 
ctgtggctgt gatccggtac ttggtctggc 15300 
atgagttctc gggggcagag atcgtggaca 15360 
tttttggctg actgtctttt agtgtggcag 15420 
acttagcaga aaggaagatc ctcctcatat 15480 
caaggagtga caggttcttg ggtggtcggg 15540 
cccgccccgc tgaaagcagg acaactcgaa 15600 
ctgcttcagg tacggtctga atctccctgg 15660 
ctggaccact tgagaatacc aggtagggag 15720 
tcagggccgg attgtaggtt tgaggtacag 15780 
gatacagaga gctgggtttc ttcactccta 15840 
gtgtgtctac agtgtgtcct ctgatcttcc 15900 
atggaaccag gaggtgaggg ctgggagagc 15960 
gccatgacct tcattggatt tgattagata 16020 
tcattgacca tgccttgcct tgtactttcc 16080 
tttgaaacca tctctgctag tggtctgaat 16140 
aaaaagaatt ttctagggcc ctgttggggc 16200 
aggggagggg gatgttctgg gacctgagtc 16260 
tgggggctgg gagaggagct cagtgtggag 16320 

101 



BNSDOCID: <WO 991 1799A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11799 PCT/US98/18426 

tagggagttg ccttgtaatg aaaaggcctg agagctggag gctgggggac ctggaaggaa 16380 
gacaccgatg ccattggatg cttttccctg gcttcccatg tgttgcttat gttagctaat 16440 
gcttaaagct cacgagttca catggagaga ggtctgttca ctcagcctga accagccaag 16500 
gcgggcccaa gatacaggct agccaggtgt ttggccctat agaaaacaaa cccctgcagg 16560 
cctgcctgat gaaaagctga ccagctgcct ccgcctggca tgggcacagc agggagaggt 16620 
caatgccctc caagggagtg cctgacatgt cccagaattc cctcccctgg tgtcccctgg 16680 
tgtctcccac gccttcggct gccttgcccc agcccctcct acccactcgg gctagagtgg 16740 
cagggtgggt gctcctcggc tttaccaccc cttgcccagc agccccaccc tggaggtatc 16800 
ctcccttcct gtgctctgct tcttctcagg gccctctgct ctcggaggca gcaggaagtt 16860 
agagacaaag ggccggaggg agcatctggg tagaggccca gctctttgcc gctttaccgc 16920 
gcatcctcgt gcaagcccct tggcctccat aggcctgctt ctccttatgg aaaaggagac 16980 
agctgagctg gggggtctct agggctctcc cagctctcat attctgggct ttaccctctc 17040 
tctcctccct ggggaccagg gcctggctca gccaggtgca ggattaacag acgtgtgctg 17100 
aggacagcag caacggaagc tgagttctct tccagggccc tttgggatag aatgacctcc 17160 
tccagaggga ctggcctgga agcccaggcc ccagaggtcc tcccaccaag gcctcccacg 17220 
tgacccagtg cagggttagg ctgcccttct caacattctc tccccttctc agcccgacca 17280 
aggagctgaa ccgcaagctg tcctcagatg agatgtacct gctggactct ggggggcagt 17340 
actggtatgt accccgacct caccctagcc tggatgtctc tgctcagacc tcctgagcct 17400 
gccaagagtc agccaaagct ttcccttcct gggccacgga ttcttcgtct gaaaaaggag 17460 
agatctggaa tgagccccga acatcctacc cattttcaca catgggggtc cctgcacagt 17520 
gggaaacaca cagaggctgg aatagtggca ggacccaggc agcacctctg tgggaattag 17580 
gaaaagactc cctttgctta agttgctgtt ttttgtgggt tgtttgtttg tttgtttctt 17640 
gacagagtct cactctgtcg cccaggtttg gagtgcagtg gcgcgatctt ggctcactgc 17700 
aacctctgca tcccgggttc aagtgattct cctgcctcag cctcccaagt agcttggatt 17760 
acaggtgtcc accaccatgc ccagctatat atatatatat atatatatat atacacacac 17820 
acacacatat atatatacac acacatatat atacacacac atacacacac acacacacac 17880 
atatatatat atgtatgtat tagtagagat ggagtttcac catgttggcc aggctgttct 17940 
caaactcctg acctcaggtg atccacctgc cccagcctcc caaagtgctg ggattacaga 18000 
catgagccac catgcccagc ctcaagtaaa atgttttcta agtccacgta ttattacaca 18060 
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gtctgagcaa ctatactcaa tgtccctgcc tggccacaca aacacatata ccctactctc 18120 
atccactagt gcacaccctg gacacccccg caccaccaac gtcatgcaga ctcctcagcc 18180 
ccctcttccc catggcactg ttcagggtgg aggtgggggc agatgagaga agcaaccctt 18240 
ttgatgggct acctttgtac agtgtacaac ctgaatgctc agacatgaca gccgtgcaga 18300 
atagactaac ctggccaggc gtggtggctc acacctgtaa tcccagcact ttgggaggcc 18360 
gaggcgggca gatcacctga ggtcaggagc ttgagaccag cctgggcaac atggtgaaac 18420 
cccatctcta ctaaaaatac aaaaattttc cgggtgtggt ggagcatgct tcccaaggga 18480 
aactgaggca ggagaatcgc ttgaacccgg gaggcggagg ttgcagtgag ccgagatcgt 18 540 
gacactgcac tccagtctga gcaataagag caagactata aaaggaagaa agaaagaaag 18600 
aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagag ggagagagag 18660 
aaagagagag agagagagag agagaaagaa agaaagaaag aaagaaagaa agaaagaaag 18720 
aaagaaagaa agaaagaaag aaagaaagaa aaagaatagg ctaaccccat tccttccaaa 18780 
cgtgtgactg atcattcaca ccagagttga gctgagctga gctgacctga aagatttt cc 18840 
cttctcattg aggaaggaag tgtgggaagc gtggggtggg acaaaagaag ggccgtactg 18 900 
aactgatgga ggagcaagga gtcctgtgga aatggcgaat gtgctaacga cagaggctct 18 960 
ttgcttgttc ttagcacagc tctaggggat ccagagtagc aggcgctcag cttagcagcc 19020 
tttgctggtc aggagttcga tgctgtgtgt acttgagatg gccaatgagc aaaggggaag 19080 
tgaatttgtg ggggcgttgc tggaatgggg tgtgctcaga gtagggtgag gccggggaaa 19140 
tagtcacctc acttcctctt ctttccttct gaagaagaag gtcaccacac cccatgaccc 19200 
aaggagaagc cccagggccc tgggaccatt tctcccctct ccctcaccaa ggcatgatca 19260 
caggggccat ctgtggcaac tgctgcagct gggagttggg agtcagacct tgggactgag 19320 
gcttcaccta actggaactc agcaagtcac tgggtcagat ccgtgtttgc catggagagg 19380 
actgtgtgga gaggcagtgc ttgctctagg cctggcttca ctgaggttca gagagggaaa 19440 
cagccgcatg gcctcaatgg cctcatctgt gaactgggga taatgaggcc tcttctacca 19500 
ttcttcccag ggagcagcaa aagtgtccta taaatggtcg acaactatct gaatatgaga 19560 
gtgttatgat aatataatct ctctggacct cagtttcccc agctgtaaaa aggggacatt 19620 
cccctttctg cctcccagga agcctggaag aagccttgct ttcataacca gctggaccca 19680 
aggaaaccca gacctacaga acagccaaaa tgagtgggat gggggaggag tgtgtagggt 19740 
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ggggagtcac acttgtcaga caacttagta acttttaact tgataggttt gcagaggcct 19800 
ttcacacata ggtgctgaag aggggatttt gattatggta gctgtcttct tcctttcgcc 19860 
gtccacccat cccatcccag ccctgccact tgtggaaagc acacagaaag cacatggccc 19920 
cagtccctag ccccaggcat ttggcatgtt ggttttcctt ctccataggg acgggaccac 19980 
agacatcacc agaacagtcc actggggcac cccctctgcc ttccagaagg taagcatggg 20040 
cccagatttc ccctcaccac cttcccccaa gggggcaccc aagcagtcaa gatcccttcc 20100 
atttcagagg ctaaaactga agctcagaga ggttaagtag tttgcccaag gtgacaccac 20160 
tggtgccgga aaacccagtg ctccctgctg cttccccggg gatatttcag gccactgaca 20220 
aggcctcagc ccaagctgag cctcatccta ttctggtaaa tcagtgcaca aatatttttt 20280 
tgttgttgcc tgtgtgcctg gccctgtact cagcagaaag gagaaagaaa ggcatcccaa 20340 
gtggaggaaa acggcacaag tgaagcctcc gcggtggaaa ggagccagag gtgttcagga 20400 
aacatcgagg gctggccagc tggggcagag gcgagggcat tgtctctgtg caccagaggg 20460 
ccctgggggt ggagcagcga gctgccgtca tgctgcatgc agcaggccga ctatggggca 20520 
gacattgtgc tggaaacaca gagaggggaa caaaatgctg tccaggtctc caggagctca 20580 
cagtttagcg gcggagtctg ctgcctctgg ctgtcctgcc caaagtgaca agcctgtgaa 20640 
cctgtgtggt gatggagatt gaccacatcg cctctggaag tgaccagaag aggccactgt 20700 
caggccacct ccctccgctg acagaggaag gctaagctgc caagtcagca aacctcaacc 20760 
ctcatgaaag gccttaagaa aactcagcag ccgtcacaac cacaggaatg agtgtgctaa 20820 
acttccgtga agcatgcacg ctgttggcag caccttctgt acattagtct gcctgcctct 20880 
cccaacaacc ttaggagcta ggtcctgtta tcttcccatc gcagataagg aaactgaggc 20940 
acacagcagt taaagtcgcc tgcccaaggt ctcagagtga gtggctaagt caggattcag 21000 
cccagggagt cacagcctcg gttcaccgcc accaagtgtc tcctgcagat attgtaatga 21060 
tcctcataga ccagccagag ggaagcacgt gatggagccc cagctttggg aaggccctcc 21120 
tcagcctctg ctggccatct tccagaaccc gggcagccac acataacgat gagactcccc 21180 
ccaccctgcc tacacacacc caggcctggg ccctgcaccg tgtatccata ggccaagcag 21240 
ctggatactc ctctggctcc tcctcccgtc ctccatatca cctcttcctc tccccatcag 21300 
gaggcataca cccgtgtgct gataggaaat attgacctgt ccaggctcat ctttcccgct 21360 
gctacatcag gtgggtttca gaaaccagtc tggacacagc ctcaggccct gatttcacag 21420 
gactcagacc atcaccctgg gaggctggtg gggcaaggat tttgtcatca tcccatccct 21480 
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tatgtagatg agaaaatcca gcctagagag 
ggttagcggg agatccaggc cttgaacctg 
ctcagaggtg ggagagatca gtgggccaca 
ggtgtcaaag tgcaaagccc tgacagatgg 
gtaatggata caatgtatgg taagctggag 
actaactcca tccatcagcc ctcctccatc 
ttagcggcct ctcaggggcc cctcttctta 
gcaaggtgga cagtcttcgg taaagccatc 
ggcgaatggt ggaggccttt gcccgcagag 
atgggacagg ccacggcatt ggcaacttcc 
gcactcccca ggccaccccc cttttattat 
tatggaaatg acaaagaccc acaaagatgc 
agggtaagtg aggaatgcca gcagcaggcc 
gcaccagaaa gagggtcaag ccctagtgac 
ccagagccag ctgcatgtgg tgactcacac 
tgggaggatc acttgaggcc aggagtttga 
tctctacaaa agaaatgttt taagtatcca 
tactcgggag gctgaggtgg gaggatccca 
cgccactgca ctccagcctg ggcaacacag 
taaaagccag agccaatctg gtgtgtgcca 
gtgctcatag agatgctctg ggatcctcac 
tccaagctgg tgagtatagg agaactgata 
tgggattcca gtccaacaac atcgctatgg 
ccctcaggcc cctctacctc accaccccat 
ctagagtgta ccagacttca gaggagcaca 
ccccattccc accgctacgc ccagccccaa 
ctcagcccag accctctttt tgctcttggg 
atcaggtgcg ttcagggtgg tatggccgta 



agaaagtgac ttgcccaagg tcacacagaa 21540 
agtttcctgc ctcttaatct aacactgttg 21600 
gcaattatgg gaggattctt agaggaatta 21660 
ggagaaatta cagtagaggc atcaattcag 21720 
atagtgatgg gctggtctgg cgagggaacc 21780 
ccaacccaac atcattgcat cagtgcatgc 21840 
ggcaccactc tgatagtgaa gcaagaaggg 21900 
tgactggggt caatctctct tcccttccag 21960 
ccttgtggga tgctggtctc aattatggtc 22020 
tgtgtgtgca tgagtgtagg tgtctcctca 22080 
accctctatg aaggtagaga atttcacagg 2214 0 
catgatctac ccaaggttac ccagctccac 22200 
agtcctgctg gctgcttggc attcaagagg 22260 
taacttatat agaccccaga gaaataaaag 22320 
ccatagtccc actgctttgg gaggccaagg 22380 
gatcagcctg ggaaacacag caagaccccg 22440 
ggtatggtgg catgtgcctg tagtcctggc 22500 
ggaggtcaag gctgcagtga gccatgatgg 22560 
caagacccct ctcaatacct aaataaataa 22620 
ggcccaggca gacaatgatg tgatggacct 22680 
ttatctgctc ttcctcggaa gctgctcact 22740 
ccatgtttat gtcttccttt ctagggccag 22800 
ccaagggcat gttcacttcc attggtatgg 22860 
cccagagcag gactagccca ggactgcagt 22920 
gcaggcacta gtacagctcg ggactcacct 22980 
atggcttgtc ccaccctaat tccagctccc 23040 
agttgtttcc cagctgtata ttcagacgag 23100 
gacccagctg tatattcaga ctctggagtt 23160 
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gccaaatgct ggagtaacag gacagtggca tccagggaga tgggagtacc ttcaggaccc 23220 
aagtgggttt ttagggcttc cctgctgggc tacagggaag agagagtgag tcagagatgc 23280 
ccctgtgggg gatctgaggc ctgacaggga tgagttgcct ctgcttagcc aacaaacatc 23340 
taaccagttg tggggctgcc agagtcaaca gccaggggac agccagacag ccagtccctc 23400 
cagccactca tcagtaggta caggaagcct gggcttgggc cctctcccag cttaatgcca 23460 
ccagcatctc tgtgtctccc agaacctggt tactataagg atggagaatt tgggatccgt 23520 
ctcgaagatg tggctctcgt ggtagaagca aagaccaagg taaactgcca ccaggatggg 23580 
ctggaggtgt gggcagcatg tcagtgactt tgggctgcat gggaggaaag tggggaggaa 23640 
gatcaaagga gaagggcatt tagtgtgcag gcatgaattt atctggagtc ttccttagga 23700 
attccataag tcagactttt caaactcgct tgcacatttg aatcacccag agagctttct 23760 
ggtgctttaa atgcccatgc ccaggcccca ttccagacca agtgaatcag tatccctgcc 23820 
ctatatcatt ggttctcaat gcacataata atcgccaggg gagcttgtta gaaataaaga 23880 
ttctgtataa tagcagccaa caagataaaa tacctagatg taaacttaaa aaggcttaca 23940 
tggagaaact ctctactaag ggacataaaa aagatttaag taaatggtgc tctctggaga 24 000 
gaagcctcaa tattatgaag atatcattac tctctaaatt aatctataaa ttcaagataa 24060 
cctcaattcc agtgctaact ggatactttt gtaacttcac aaaatgatag taaagtttat 24120 
ttgtaaaaat aaacatgcga agatagctag gaaaattctg aaaaagaaga gcagggccag 24180 
gtgcagtgcc tcatgcctgt aatcccagca ctctgggagg ccaaggcagg aggatcgctt 24240 
gagcccagga gtttgagagc agcctgggca acatggcaag accccatctc aacaaaaaat 24300 
acaaaaaaaa attagctgca catggtgtca tgcacctgca gtcccagcta ctcgggaggc 24360 
tgaagtggga ggatggtcta agcctaggga ggttgaggct gcagtgagct gtgaccacgc 24 420 
cactgcactc tagcctcggt aacagagcaa aacccaatct aaaaaaaaag caataagaaa 24480 
agccagtcct ctcaaatttg aacacaaact accataacta aaacagtatg acactggaga 24 54 0 
tgaaatagat agctgtaaca atgaaacaga acaatcagtc cagaagtaaa ccctcttatt 24600 
cccataaaca acatgggaat ttagtttaca agaaagctga ttttgcaaaa aagtaagaaa 24660 
aaaatgatta ttcagcaaat ggacaggggg aacaaatggc taatcatttg gggggattaa 24720 
actgaataac ctccaaaata aaattccaga tggaccaaag ttttaaggtg agcaatgaga 24780 
ccacaaaaga tgtagaggaa aacctatatc aattcattta taatctcagg gtggaaaaga 24840 
gtctttctgt gcaagaccag aagccatgaa ggaaaagatt aataactctg actaaataga 24 900 
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aaatggaaac aatgtttatg gaaacaatgc cataaacaaa gccaagagat gtgtgacaaa 24 960 
tatctcttgg ctttgtttat ggcattgtga gagggtgagt tttcttattt taccatgagc 25020 
tttttacaaa tccgtaagac aaaaatagac aaaacaggaa aaaatagggc aaaggacatg 25080 
aaaaggcaat tcactaaaat taacaccaaa tggtccacat gcatgtggaa agatgctcaa 25140 
gttcatcctt aaagaaatac aagaccgggc gtggtggctt atgcctgtaa tcccagcact 25200 
ttggaaggcc gaggctggtg gatcacctga ggtcaggagt tcgagaccag cctgaccaac 25260 
acggagaaac cccatatcta ctaaaaatac aaaattagcc gggtgtggtg gtgcatgcct 25320 
gtaatcccag ctactcatgg ggctgaggca ggagaatcac ttgaacctgg gaggcggagg 25380 
ttgcggtgag ccaagatcat gccattgtac tccaacctgg gcaacaagag caaaactctg 25440 
tctcaaaaaa aaaaaaaaaa tacaaatcag ggccaggcat ggtggctcac acctgtaatc 25500 
ccaacacttt gggaggcaga agtcagcgga tcacttgagg tcaggagttc aagaccagcc 25560 
tggccaacat ggtgaagccc catttctact aaaaatacaa aaattagcca ggcgtggtag 25620 
tgtgtgcctg taatctcagc tactcaggag gctgaggcaa gagaatcgct tgaacccggg 25680 
aggtggaggt tgcggtgagc caagatcaca ccactgcact ccagcctggg cgacagagcg 25740 
agaatctgtc tctaaaaaaa gaagaaggaa aagaaagaaa agagaaggga aaaaaaagtc 25800 
tctttcttgc tcagtttcct tcctgagaac atttcaggca cacacagcgt atgtatctat 25860 
ctatgtatat attttgaaaa tacaaacaaa atggattcta ctgtacatac ttttctgcac 25920 
cttccttccc tgccccgctg ccccttactt aatagtaaac ctaggagagt ggtctgcaat 25980 
agcgcaggga gatctacctc attcttttga agggttgtcg aagagcctat agtatgtgga 26040 
tgtcccatga tttatttaac caaccgtgga tattgaggct gtttccaaat ttttactata 26100 
tttgccatca atgctacaaa tcactgtata aacatcttgg tgagcttttg ggactatatc 26160 
cagaaggtac atatctggcg tagggatttc caagacaaaa agtacaagca tttttaatag 26220 
gtaatttgat agatattgtc aaactgccct ccagaaaggt tgcaccaatt gtactttctc 26280 
caacagtgtg tgagagtigcc catttcctca catactcact agcactgaga atcatcgaag 2634 0 
tcactaattt ttccaatctg ataggtgaaa catggtattc tattgttgtt ttgacttgta 264 00 
tttctttttt tctttctttc tttctttttt tttttttttt ttttttgatg gagtctccct 26460 
ctttctccca ggctggagtg cagtggcgag atcttggctc actgcaacct ccgcctccca 26 520 
ggttcaagtg attctcttgc cttagtctct tgagtagctg gaattacagg cacacaccac 26 580 
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cacgactggc taatttttgt atttttagta gaaacagggt ttcaccgtgt tggccaggct 26640 
ggtcttgaac tcctgacctc aggtgatgca cctgccttgg cctcccaaag ttctggatta 26700 
caggcgtgag ccaccacacc cagccttgaa tttctttaag gatgaacttg agcatctttc 26760 
cacatgtatg tgaaccattt ggtgttaaat tttagtgaat tgctttttca tgtcctttgc 26820 
cttatttttt cctgttttgc ctatttttgt cttatggatt tgtaaaagct catggtaaat 26 880 
atgaaaattc accctctctc tgtgatatgt. gctatatttg tcacacaact cttggctttg 26940 
tttatggcat tgtttccata aacattgttt ccattttctg tttagtttag tcagagttat 27000 
taatcttttc cttcatagct tctggtcttg cacagaaaga ctcttttcca ccctgagatt 27060 
ataaatgaat tgatataggt tttcctctac atcttttgtg gtttcagcct cccgagtagc 27120 
tgggactaca ggcgcccacc accatgccca actaattttt gtgtttttag tagagagtag 27180 
agacggggtt tcaccatgtt ggtaatgctg gtcttgaact cctgacctca aatgatccac 27240 
ccacctcagc ctcccaaagt gctgggatta caggcgtaag ccatccattg cacctggcct 27300 
cttttttttt ttttagagac agggtctcac tctgttgctg aggctggagt gcagtggcac 27360 
gatcacagcc cgctgcagct tcaacctcct gggctcaggt gatgctccca cccttccacc 27420 
tcagactccc aaataactag gactatagat ggacaccacc atgctcagct aatttttgtg 27480 
tttttttgta gagactggtg tattagtctg ttctcacatt gctataaaga tactacccaa 2754 0 
gactgggtaa tttttaaagg aaagaggttt aattggctca cagttcccac atgcctgggg 27600 
aggccttggg aaacttataa tcatggcaga aggcgaaggg gaagcaagga tctcgacatg 27660 
gtggcaggaa agagagagct aacaagagca gggaaaactg tcttataaaa ccatcagatc 27720 
agccaggcgt ggtgggtcat gcctgtaatc ccagcacttt gggaagctga ggtgggtggt 27780 
tcacctgagc ttaggagttc gagaccagcc tagccaacat ggtgaaacct catctctaca 27840 
aaaaatacaa aaagtatctg ggtgtggtgg tgcccatcaa taatcccagc tacttggaag 27900 
gctgaggtag aaggatcacc tgagcccagg aggtagaggc tgcagagggc cgtcatcgtg 27960 
tcactgctct ccagcttggg taacagaggg agatcctatc tcaaaaaaaa taaaataaaa 28020 
taattgaaat ttagatttct gggccctgcc ctaaggattc tgcctccagt aggtttggga 28080 
ggtgggccct ggaatctgca tatttatcat gctccctaag tgctgttgat gtgcgtactt 28140 
tgaaaaacac aaacctaagt ctaaatttcc agaatcagga gggaagcacc aaaggtggag 28200 
cactaagtgg cagacaaggg ggctggccaa gagaagggac tctagaaggc tcaggcctag 28260 
ccagggtgtc cagggttgat gaggcccagc tccttgtgtc ctccgggtgg agtgctcctt 28320 
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ccttcccttc agcccagttc ctcctcctcc ctcactgtcc tgcttacccg gcttctattc 28380 
tgggctccca gtacccaggg agctacctga cctttgaagt ggtatcattt gtgccctatg 28440 
accggaacct catcgatgtc agcctgctgt ctcccgagca tgtgagtgcc cctcagcatt 28500 
gccttctccc tgaccctggg cctttcctgc ctctgctacc tgccaccaca tcctctgtcc 28560 
ctgcccctcc tccaggaggg tccacactgg tggcacctgc agacacacac tggggcattc 28620 
ctccccagct catcagagac cccagagctt ctagaacttt ccagtcagac cagcctgccc 28680 
aaccccaggt agtgaataac tggaggaatc tgagattggg cctctgagct cggccactac 2874 0 
aaggcctgga actataagtg atcccttcta ttcttttgcc tcagtctcct cttttgtaag 28800 
ctgaggataa gaatgcctgc ccagcttact cccattcccg ggcctgaaat ctacccaccc 28860 
caccgcccat gaacatcacc atgacatgaa acagccagcc aggggaactg cctctgaaaa 28 920 
gccccagaga attcctgaag cctgatgatg gtgggggacc tgtggccatc tggactatgg 28 980 
tgacagctgg agtaccacaa caggggactg ggcgtcacca agacttcacc tcttggcagc 29040 
ttggcttaga gaggctgtca ccccttctat ccttcgcagc tccagtacct gaatcgctac 29100 
taccagacca tccgggagaa ggtgggtcca gagctgcaga ggcgccagct actagaggag 29160 
ttcgagtggc ttcaacagca cacagagccc ctggccgcca gggccccaga caccgcctcc 29220 
tgggcctctg tgttagtggt ctccaccctt gccatccttg gctggagtgt ctagaggctc 29280 
cagactctcc tgttaaccct ccatctagat ggggggctcc cttgcttagc tcccctcacc 29340 
ctgcactgaa cataccccaa gagcccctgc tggcccattg cctagaaacc tttgcattca 29400 
tcctccttct ccaagaccta tggagaaggt cccaggcccc aggaacacag ggcttcttgg 29460 
ccccagatgg cacctccctg caccccgggg ttgtatacca caccctgggc ccctaatccc 29520 
aggccccgag ataggaaagc cagctagtct cttctcttct gtgatctcag taggcctaac 29580 
ctataaccta gcacagactg ctacagctgc tcccctcccg ccaaacaaaa ccccaagaga 29640 
gcaatgcccc taccacccaa gggtgccatg gtcccgggag agcccaaacc tatcaccacc 29700 
tgttgggcat agccagagct gttcccaccc agccagggca tgaaacatca accccccaca 29760 
tgtgaaccca tcattcctaa accctgggta ggctccatgc caagtaacag cagagggagt 29820 
taagccatag gaatttggct gtggagtaag agggaatgcg gtgaggcact ctggaatatg 29880 
accctaccag aggttggaga acaaacttgg gcagccggaa cccgtcacta ttctagactt 29940 
ccctggcatt cgaggagccc tttgaacttt ccaaagtgca gccacagcta caatgctgtt 30000 
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aaatcctccc acattcttgg atgccccttc accttgtgtg gacagtgtct ggtttctcca 30060 
ttzttacagac aggaaaactg agcttcagac agggggtggg ctttgcctaa ggacacacaa 30120 
atttggttgg gagttgatgg ggccagatga gccagcattc cagctctttc acccttcagc 30180 
aacatgcaga gtccctgagc ccacctccca gccctctcct cattctctga acccactgtg 3024 0 
gtgagaagaa tttgctccgg ccaaattggc cgttagccac ctgggtccac atcctgctaa 30300 
gacgtttaaa acagcctaac aaagacactt gcctctgggt tttgcattgt gtctgctgtg 30360 
ttgccggaga ctgctgtcac ctgtggcttt tgtggtgggg gaggaagaag agagggtgag 30420 
gtgggggtgg ggacatgagg ctcctagaag ctcagtgggg gagcacgtgg gcttgagagt 30480 
atgtgtgtga gcatgcatat gaatatgtgt gcgcatgtgc acagatgcac gtgcacacca 30540 
cgcccaacca gccaacctcc agcctgtcta aaaaggtatt gtaaccttgt ggaagggaca 30600 
gaaaggagcc ctacactggg gaacttgtca aatgtaatcc agaaaagttg gggctggcac 30660 
ctttgccaag cagcgcacag ttggtggaag gaaagaacca aggtgcaggg tgccaggcaa 30720 
aggcaccagg cagagaagtc tcctggcctg aaaccatcag gcctagtctg ctgagtgtga 30780 
gggtgacctg tgatggttac catcaccagt gcttgcatcc agaggctcca gccctgacca 30840 
agccttactg agtatccaaa gatgggcctc tgtttgctaa ataatttgct ccctcacctg 30900 
ctcaccccac aaagacaggt actactaata aaattaaagt catcatcacc ctcatcatca 30960 
tcaccatcat catcatcatg gctgccattt actgagtgtt tatgtgccac cgtgcaagaa 31020 
gctttacatg cattttctca tttaatcctc acaacctctg agaaggccct tttcttattc 31080 
ccatcttaca agtgcaggaa ctcagtgagg cccatagaga tttatcaagt gactcaaagt 31140 
cacttttttc attgggtaaa tgaaaaaaag ccagatttaa accagtcaga ggactctaga 31200 
atccactctc aagaaagggc aagtcctggc cgggtgtggt ggctcacccc tgtaatcaca 31260 
gcactttggg aggccgaggt gggaggatca catgaggcca ggatttcaag agcagcctgg 31320 
gcaacatggt gaaaccccat ctctactgaa aatacaaaaa ttagctgggt gtggtggcgc 31380 
acacctatag tcccagctac tcgggaggct gatgcaggag aatcgtttga acccaggagg 3144 0 
cggaggttgc agtgagctga gatcgtgcca ctgcactcca gcctgggcaa cagagcgaga 31500 
ctccatctca aaaaataata ataataataa aataaagaaa aataaagggc aagtggtatg 3156 0 
gctttcatca ttttactgac aagaaaactg aggctcaagc aggagaagca gatccacacc 31620 
atagtgcctc cccagaccct gccagacctt tgcctttgca ccctgacact tgctcctggg 31680 
gttctggcag gtcatgggtg gcaagaccag gcacctgacc aacaccctca cacaggcctt 31740 
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tcggaccgag gcagctcgtg cttttttcct cagtgcgaga cccaggactc cacctcatct 31800 
ctgaatcccc ttaactgccc cttccagcct accattctgg gaggaagaac ccagccctgg 31860 
aacgtggctg gaacgagggc tcacagtcac cccagctctt ccctcccctg ggacccagat 31920 
ggccagactt gaccattcct gctttaaggg attaactzctg ggtcaggcag ggaaacaggg 31980 
cacatacacc agggctgata cacagggact cttgtgacat cgttaggaat gaaagattgt 32040 
taggaactgt ccccttgggc tctagtactt tgttttaatg ttggaggatg gggcttagca 32100 
aacctctcct cgcctgctga ttgcctgggg agccggccat gtggaaaagc cccttttcct 32160 
aggagttaag attctggggt ttgagtccca tttgtgccag ctgtgtggcc ttgggcaagt 32220 
cacttctccc tggtcttcag ttatcctttc tggaatctcc catgctgcct tcttggtgag 32280 
gttacagcga gaggctcatg agacccggat gtggcagcac tctaaaccag gaagccacac 32340 
agacaagaaa tgatattgct gttatttgcc cagcacaagt tcaattcagc ttcacctttg 32400 
caggtaggta agaaaaaggt ctggaaccca agttttatat ataaatagcc aaaattagaa 32460 
aggctctaga gcaaacatct aatttacgtc tttccaacta tgctcccggg agcctgagat 32520 
gcctcagaaa caccacaggg gctgtcgagg aaaacagagc tgctgcacga agtagccctg 32580 
acgggcttcc cgtaaaattg tacttgaagt agttgcaggg ccagcttcac gggtgtagga 3264 0 
cctgtgcagt ccctcaagac tttgtgctta caagggcccc atgcttggtt taatgctcgc 32700 
ctgtcactgt ctcaaaattc ttgataattt ttaagcaaga ggcaccgggc attttcattt 32760 
tgcacaaggc ccgcaaaatt caagagccaa tcctgaatag gagctccatg accaaacaca 32820 
cacacacaca cacacacaca cacacacaca cacacacaca cacaaaacct ggtgcagtcc 32880 
tttcagtgta cagatgggaa aacttcaaca gcaacccagg aagaggaagg gacttgtcca 32940 
aggtcacaca gtccagtggg gccagagcaa ggactaaatc cgcagcatac tgactcagct 33000 
ccacaatggc tccgctctcg ctttcagtgt gtgcttctgt cactccctta ctgcctctaa 33 060 
tgacccactg aggcccaagg agacctgttg aaggtcacac agctgtgaca acaatggtga 33120 
cggcggtgtt gagccacgtt ccttcaagct tcaaatctga atagaaaagg gcataggtaa 33180 
gatgataggt agataggtac atacacaaat catctaaata gctccttttg ttttttgttt 33240 
ttttttttag agagagggtc ttgctctgtc acccaggctg gagagcaatg gcaccatcat 33300 
agctcaccgt aaccttgaaa tcctgggctc aagtgcttct cccgcctcag cctcctgatt 33360 
agctaggact acaggcatgc accgccatgc ccgattaatt ttttttttaa tttttatttt 33420 
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atagagatga tgtctatgtt gcctaggctg gtctcaaatt tccggcctca agcaatcctc 33480 
ctgccttggc ctcccaaagt gctgggatta caggtgtgag acaatagttc attttttaaa 33540 
atgataacta actcagcctt tgctggtttg ccagtcagct ggttccagct gtgtgtgagt 33600 
tgggacctca gaaggggctg ggctggctac tcttcagcca gctaaaccac gtcagcagtg 33660 
aattttacca actcactggg tgcttcctca aatcagtggt tgggcactct tggcaccaaa 33720 
aaatatatat atatagaagt gcgaaagatt cagtagcaac cgtttcctcc ctgtccgtcc 33780 
ctgctcctca tggctcgtca cagccaggag ggggaggtgc tccccaaacc ccaagccagc 33840 
tagcctttgc ccagaggccg gcccatctct tcacttcctt gtttctttgt gaaacaggaa 33900 
ctgagcaagc agcttaagtc gggtaggatg ggtggttgag aaagccccag tggtccgtgg 33 960 
caagtccacg aagctgaaaa gctggtttct ctcccagcgg gtcccaggca gcttcaggcc 34 020 
ctcttgtttg tcacacctct tgtgttctag aacaacttgg ggtgagggat gcactcttga 34080 
caaggtggag acataggata tgggccacag ggtcacgtag cagagattca gatgcggacc 3414 0 
tcaagagaca tagaagcagg tgccaggccg ggcatggtgg ctcacgccta taatcccagc 34200 
tctttaggag gccaaggtgg gtggatcacc tgaggtcagg ggttcgagac cagcctggtt 34260 
tggcaacatg gtcggcaaca tggtgaaacc ctgtctctac taaaaataca aaaaattagc 34320 
tgggtgtggt ggcaggcgcc tgtaatccca gctactcggg aggctgagac aggagaatct 34380 
gggaggcgaa ggctgcagtg agccgagatc ttgccactgc actccagcct gggccacaga 3444 0 
gtgagactct gtctcaaaaa aaaaaaaaag aagaagaaga agattccaag gggacactaa 34500 
gggagaaagc ggcagctcct aacggtcacc aaggccttcc tggatttagc aaggggagaa 34 560 
actccaggtg gagtctgaag tgaactgatg tggagtcact caccaggtac ccagcacttt 34 620 
ccaccagctg gggcactggg ctgagagcac tccctctatg catgtcacct gagtagtaac 34680 
cctagtcaca cacacccaca agaaaggccc acattatcac tgttaccatc ttcactaagg 34740 
ctcattgcag gtgaggtcac ttgcccaagg ttatgcagcc agtaggtgac catacaacaa 34800 
tctgaatcca atcaggacaa cctgactcca tttctgagag gtgttctgtt cctcctcttc 34860 
cctggagccc tggacaattg tccccatggg gagaggggcc cactgatcag agaggtttgt 34 920 
ccttgtaacc ttgggacaga gatttatgca gagtagccca gcccctactt gtcccccata 34980 
gaaagtgcac ccccccacca gtgagtcatg agtgtgtttt aatctgagca ccaaagtcca 35040 
gcctcctctt ctgatggtaa ttgacagagc taccagcagg tgttttttgt tttgttttgt 35100 
tttaaaacca agaccttatc acgagttcag agttccaaag atcactgtct ttccctcccc 35160 
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tcccatcagc ccctgccacc agatgcccac tctgtgtgtg ttgtatgtgg gcatgaggaa 35220 
tgggcgcaag agtcacctgc ccttgtctcc ttgctttcct ctgcccgcct tctccctcct 35280 
gcaactttgc tctttctgcc ctccctgcct aaaatgacca ccattaaatt tgctaatggg 35340 
ccgggctact aaagtgggct tcaccttgga aatgtccagg tattttcaag aaactatgtg 35400 
gcctctgctt ggatatttct ttttcttttc tttttttctt tctttttttt tttttggttg 35460 
ttttgttttg ttttgttttg ttttgtttga gacagagttt cactcttgtt gcccagcctg 35520 
gagtgcagtg gcgtgatctt ggctcacagc aacctccacc tctcaggttc aagcgattct 35580 
cctgcctcag cctcccaagt agctgggatt acaggcatgt gccaccatgc ccgcctaatt 35640 
tttgtatttt tagtagagac gggtttcacc atgttagtca ggcgggtctc aaactcctga 35700 
cctccggtga tccacccacc tgggcttccc aaagtgctga gattacaggt gtcagccacc 35760 
gtgcccggcc tacttgaata tttcaagaga tggggagatc attacttacc agagctgaac 35820 
tagtgccatc tgaatagcca aaatttgttc cttggaatat actaaaacct tccatttatc 35880 
cgttccccat cattcatacc acttgcttcc ccaccccaat cacctaggct aaatgtgaca 35940 
ggagtttagt acttcacacg ttgacgactt ccctatactt cacccccgct taatcgaccc 36 000 
tagctattgc agctttgcag atgtgatgcg acttccagat gtcctcccca tcctgttcac 36060 
tcccctccgg gcctgcttca aggtggctct gtggcaccta gtgtggcacc caaaccagca 36120 
tcaggtgcca agagaaggct gcaagccgag ggctccataa ttctctcacc tccccgttca 36180 
ggaactccgc atcagaggca cttcagctga gtggttagtt gtgtgggccc tggccaagac 36240 
cagacctgca tttaacgctg ggctccactg cttactagct gtgtgggcaa ttcacttaac 36300 
ccctctgggc ctcagcttcc tccctggaaa atggggataa caatagtgcc tgacttacca 36360 
ggttgttgta aagcccactg cctgccatga agtataataa ttttcagcta gatgtgggga 36420 
ctggggggaa tggaaataag aatctttggg ggccgggtgc agtggcttac acctgtaatt 36480 
gcagcacttt gggaggtaga ggcaggagga tcacttgagg ccaggagttc aagatcagcc 36540 
cggacaacat aggcaaaaaa aaaaaaaaaa aattagccag gaatggtggc acgtgcctgt 36600 
agccccaact acccgggagg ctgaggtggg aggatcatga gccccaggag tttgaagctg 36660 
cagcgagctg tgaccacacc actgcattcc agcctgggca acagagggag accctgtctc 36720 
aaattaatta attaattaat taattaatta attaaaactt aaagaatctt ttgggaggtt 36780 
tttccagata taagtatctc tcactatcca aactcttact atctcaaatt agaaatatat 36840 
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ccattcagag agtgagagat gcttttgtac atggtctgga tcttacctgc cccaaatatc 36 900 
ccactcatga tcatctacta aataagaatt tcttggctga gtgcagtggc tcacacctgt 36 960 
aatcccagca ctttgggagg ccaaggtggg cagatcacga ggtcaggaga tcaagaccat 37020 
cctggctaac atggtgaaac cccgtctcta ctaaaaacac aaaaaataag ccaggcatgg 37 080 
tggcatgcgc ctgtagtccc agctacccag gaggctgagg caggagaata gcttgaaccc 37140 
aggaggtgaa ggttgcagtg agccgagatc gcgtcactgc actccagcct aggcaacaga 37200 
gtaagactca aaaaaaaatt aaattaaatt aaatttaaaa aaatacttgg agtgtgctgg 37260 
tgtgggagct gctttatctg aatgtggcct acgatgacct ttgtgtcttt ggtagtccta 37320 
catccaccct ctgagggttc aggttgattt cactatcagt ggaaccccaa aacacacaca 37380 
cacacacaca cacatacaca ccctgcaatc ctgcagtatt ggcgctaaat caaggttctt 3744 0 
ctgccccaaa tctatacttc tggatacgtc tttatattaa actactcgcc tccaggggtt 37500 
gtggtgggat taaacatctg tcaagtgctt aatgtagtgc ttggtaaaga acaagtgctg 37560 
ggtaaatgta agcttgccac tccacctagg atgtcagctc ccaggtacaa ggggccctga 37620 
tggctattta ttttgtacct cctcccgagg cctagcacaa ggctgtgccc acagcaagtt 37680 
cctaggttct gcctgccggc ccattatcaa gtttaaggtg accttgttcc ctgaactact 37740 
gactcgtatc acttccttag ctgagagctc tgcgtacctc aaacaacata acctgtaaaa 37 800 
tgcaaatatg ttacccagaa gggaaaatct attaaacaca tgtttaaggg tcattagcca 37860 
gcaaggacct ggcctattgg gattcagcca gggcaatctg ttggaagaga aaagctgggg 37920 
gaaggcaagt cccgatgggg ggtcctggac accccgcctc cactggcccc ggtccccagg 37980 
tggcggggag gaggtgttcc tgaacagagg agacagatct gaaaaggaga acgaaggaag 38 040 
ccgttgaaaa cacttgctcc cttctgtcct gggagctggt cacctctcca tatttctctc 38100 
caaggcaccc tcacctccct ctcgctgcct tttcccaagc ctcgccagcc ctgtgccctg 38160 
gtttgctggc cgcctttgaa ggttctgcgg ttagtgtgtg atcttctgtc gctgccggcg 38220 
ctgagcaaac aggaagcggg cacatcctgc gaggcggcca gggacgcagc cgctgcgcag 38280 
gcccggaaga ttctgcgacc tgcttcctcc ctcacggctc tggctccagg tgccccaggc 38 340 
cactacagcg gcgtccccaa tcccagaacc ctctggggct gccctcctgc tcctggtcac 38400 
gcagagcccc aaaggcccga tgggtcccct acacggctgc caggctactg tatatgccga 38460 
aaagccttaa ccagttaccg cgcttgctag tgaccctggg ccgcctggga gggacttztca 38520 
gggtggaaaa cacaggccag tgggaaggag atctggggtg tggaatcttt tcatgagtct 38580 
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gcctggccga ctgagtaggc caaagaatgt atggtttcag cttagggaag aaagatctta 38640 
tcttccctga gctcaccctg aaggttggat ggtcagaggg cataaagcag aatcggagag 38700 
gaagtcacga gaacatcttg ggaacagggg tggacaagcc tagtatttca caagtaccac 38760 
agaattgaaa gccttaaaaa tgtgcatacc tttgacccca gcaattcctc ttctggatat 38820 
ttcccctaca gaaaaaaaac tatggatgtg aatatttacc ataagtatgt tcaatgaata 38880 
atttaaaaat ttaagccaac ccaagttcat cagcaagggc ttggtgaaat aagcacggca 38 940 
catccacaca atgggatgtt ccacagccgt tataaattac gtagttgaag actatttcat 39000 
agcatgagac aatcttgata ctgtgttcag ctggacgtag tggctcacgc ctgtaatccc 39060 
agcactttgg gaggatgagg caaagaggtc acttgagacc aggagttcaa gaccagcctg 39120 
ggcaacatgg tgaaaccctg tctctacaaa aaatacaaaa attaaccagg gtggtggtac 39180 
gcacctatag tcccagctac ttgggaggca gaggcacaag aattgcttga actcaggagg 39240 
tggaggttgc agtgagccga gatcacgcca ctgcactcca gcctgggaga cagagtgaga 39300 
gactccatct cgaaaaaaaa acaaaaaaga tactgtgttt agtgtgaaaa gcagacacca 39360 
atagtgtgaa ttatgacctt ttttaattaa aaaattttaa acagctttat tgaggtataa 39420 
tggagctaca ataaactgca catatttgaa gtatacaatt tgatacgttt tgacatgtgt 39480 
attcccccat gaaaccatca ccacagtcaa gatagtgaac atattcatca ctcctaaaag 39540 
cttcctgtta ccctgtgtga tgtctccctt cccttctctc ccctcccctt ctccccaaac 39600 
caattatctg ttttctgagc agtatgattt ttgtaaaaag aaaaaaaatg tttatagaca 39660 
gagagtagat agatgcttaa aacagtgtag aggacaatga tgggaaattt tgaaattttt 39720 
tttgtttatg tgtttgtgaa attgtatgac aaatacaagt tgcttttgta ataccaagaa 39780 
gttattaaag ttattaaaat ctgccacaga caacccacaa attgaatata ctgcataggt 39840 
gtttgataaa tgcctgttgg atacttgcta gaggtgtgga gttggctgta agtgcttttc 39900 
agctgtggtg gtgataaata tactcatgat ggctgagggg tactgactgg aggggtgact 39960 
gctctagtga tgatggcttt aggtgtactg atggctgtag gagtgatggc tgcagggctg 4 0020 
atggctgcac ggatgtgatg atggttgtag gagagatgac tgtagggacg atggttatag 4 0080 
gggtgagagg gtcatgagtg cggttggctg taatgtccag aagggtgaca gctgtaagga 40140 
tgatggctgg aagggtgata gcagcaatgg tgatgactgt agaggtgatg gctgaaggat 4 0200 
gtcaactgta gaggtgatta ccatagtgac aatggctata ggtgtgctga tggcagcggg 40260 
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gatggctgca ggcctgatgg ctgtaggggt gatggctgta tggtgatggc tattgatagg 40320 
agtgatggta gcagtggtga tggtcacagg ggtgatggta gtggtagtga tggcagtgac 40380 
gaccttgaag acgtgagttg accgggagca ttgatccaac aggcagccag atggtccaac 4 044 0 
ttctggtttc tggattctgg tcagtgagat aaagggcttg ggtacgccat ataatgtcga 40500 
gctgattgtc cacctctagg aagatggagc agaaggactg atgatgatcg acatgagaca 40560 
acaaggtagg gactttctct gactgttcaa cccaagagcc gtctgcatca gaatctctgg 40620 
agactgaagc ttggacttcc ttcctcccta ggggccggca ggggctgcga ttgctgtaag 4068 0 
agtaggtcac gtggcagggc ttcctgtatg ctgctgctgc cgggtgtcca tggcccgcac 40740 
ccccaagctg ccactgcagc agtcagagtg gcagctgaag gctcggttca tgccgtgccc 40800 
ccgggcagtt ctggtgaggc taagcaagag gcctctgcat cttgacacct aggagagcag 40860 
ggacggagtc tcccagggtg gaggaccatg ctgcgccgca agccctccaa tgccagtgag 40920 
aaggagccca ctcagaagaa aaaggtgagg agcattttgg gaactcacat cttcctttcc 40980 
tctgcttgct cgacccatcc cttccaagac tcttggaagt gggaagagcc aaaggccata 41040 
tgttgagtct gtaggtgagg gccactcacc tagaaggcaa cgggacaggg acccaaaggt 41100 
taacagaagt acagaagaga tgtttaaaat tctggagcaa cagagttaca gaatagatcc 41160 
ttagaaatct agagtcatgc aattcaaccc tagaagaatg ggaaaaaaga tctaaaatta 41220 
ggggccagaa tcttagacta gaatttgaga ataacaaaat acaatgctta agctgaaccc 41280 
taagcttgta cttaactatc tctcctctac gctggtcctg ctgccacttc tgccccggcc 41340 
acggcctctc aggagccatc tcactcaaag ggccctgctg ggctgcccag cttggagcca 41400 
tgatgctcac aggactgcca ccccctcctc cactttggcc atggcccaga ccctgcccac 41460 
ttggctcagc tctgtggtag ctcccaggcc ctacccgaga tggagccaac ctggagatgg 41520 
gggcctgagg ccattccagt cttgggggaa aaggcatccg gaagtgcagt gggatgcagg 41580 
gggcatacag gttcttcctg ggtggggaat gagaaattaa gttgaggtgg tgagtaggag 41640 
tgaaggctga ggagggagga ttttaaaggc caggaggaaa aacagctgtc atggatggct 41700 
gaggtgttca tccccctggg gcagagaccg gactggaaga atccttgaag gctgtgccag 41760 
atctaggatt ttgtggttga atttgaggat ttccagattt ctttggttgg gtttgaggat 41820 
ttctagacct aattctttaa ttcaattgaa ggattctccc agacaatcag caagtatatc 41880 
ctacacatcc aatgggatgc taggacagtc ccaaagcagg cctgaactag gtccaagagg 41940 
cccaggctat cccccgccaa ttttccctgg aactcagccc aggtgcccta tggtggatcc 42000 
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cccttgtcag gggagcagct cagagatgga ggccaaggca gctgaaagac tgagactcgg 42 060 
ggcagtgggg tgtgcatcct gccccagctc tcaggtcctg gggtggctgc cacttccggc 42120 
tgtattctcc ccaccccagc tgcttctcta aggttgcagt ggtggtgggc gggggagggg 42180 
ggaggttcat gaatcttcac acgggctctg tcatccccgt ccaaaccctg cagggtgcta 42240 
ggcccagcag gactgccatt ggctgccact cctaagagag cagggccact tccccttgct 42300 
ctctggctgg tgggggtggg aaacagagat tgctctatag acactgctag gcctcaaacc 42360 
acaggggcgc tgagctggag gaaagtgtga gaaagagctg acactttgac actgggaccc 42420 
tgtgccacat caggaagacg tacctagggg cacacagaag ccagaagtcc cctttttctt 42480 
tcccagcttc aattacccag attctgtgtt cggcaggcaa gtgggcccct gggatttccc 42540 
caggtctctc ccaggagaag gggtcctttc ggggagagta gtccaggcaa accccacccc 42600 
aacccctgcc agagccggtg ccagcctcca gagctgggat gagcttgcct ccacacactc 42660 
tgctagcccc agactatgga aatgtccccc agaagccaag cttccagaag gtacagagca 42720 
tcttccccag gcccagaggc agcgggctgg ctgaatcact gctagtccct tcttctcaga 42780 
ggctgagatg acgtggtgga gaggaaagca caaaggcaag agaggcctct ttctcctgcc 42840 
cctgcttctc ctcttgggcc tgccaatgga gaatgtgcca gacagggggc aagaaagctg 42900 
agctggaatc ctaccacgct ctagctgtag gaccttggac agctcattga ctttctttgg 42960 
gcctcagttt cctcgcctct gaaaaggagg ataactccta atctgccccc ctcaagaggt 43 020 
tgtacagatc acccaaaaga atagatgtga gagtaccctg tcacaagtgt gcacgtgcat 43080 
accacacaag ggggtgcctg ctgtaggcat gggtggggat agggagacac tgcaaagaag 43140 
aatgagccat gcccccagtg ctggcacaga gaacacaaga cttaatagta ctaagagcca 43200 
ccacagagtg agagctcatg ggatgccagg cacggtgctg tgcatctctt accttgttca 43260 
ctccccacga caacctggtg aggtaggggc agatattgtc attatcccta tttgacactt 43320 
gaggctcaca gaggtgaagc aacttgctga aggtcacaca gctgggaagc tgccgagcag 43380 
ggatccaagc caagatttgt ctgactccag aatcttgacc ccttctctct ccactctctt 43440 
gtccccttgt tgatctcaaa ggcgctagaa gagtctgatg agcataaaag attcaaccca 43500 
tcagcaggag agctccgtgg ggcagaccca gatagctgtg gaatgagtga ccttccagcc 43560 
agcagagcag aggagggtgc gggccctcta gaatggtagg gctaggacat ggcgcctccc 43 620 
atggatcgtg aaggattgga gaggccccag ggttgggaga cagacctctt agctgccctt 43680 
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cggagacttt cacttccttc aagttcttgc tctgcaagcc ctttgatctg atttcaccca 43740 
aaggccagtg ttggggtata ggaagtggag agggattttt cctggacttg gttcccctaa 43800 
ggatgctcct actctaccac atccccactt tctccatggc tcccttccta cagcacacct 43860 
ccagtatgac ccccacaccc tgccagcatg cactttgtat tttcagtggc tcctcactgc 43 920 
ccacaggata catgcccttg gcctggcggg caagagcctt ccatagcctt ttctactccc 43980 
ccacccacca tcccatcatc aggctctcaa cgtctctaac tttcattctt gttcaaaccc 44 040 
ttccgagctc cacctggaga aatcctgcct cccctgcatg gcccagctca gatatacctc 44100 
ctccacaaag ccccgccaga gcccttcccc ctctgcactc ccacctgctt tcctaggagc 44160 
tctgttcaaa aaccacctgc atgccatcac cactacactc aatccactgc taacttctac 44220 
actggtgggg cggggcaaca gcatgagctc tgaggtagca cgagcacagc aacatacttg 44280 
ggtttccaga gtcatagacc tgggtttgga ggcttgctct tcctaagacc agctgcgcca 44 34 0 
ccctgcctag ggcacttcat ttcgcatctt ggtctcctct gcttcgaggt gcgaatatga 44400 
tgcggggctg cgatgaggct ggagtgagct ggcacacagg aagtgcttgg caggacacgg 44460 
gcatgcggga ggagtttaga ccctcattgc tggattctca cacacacaga ggtagggcct 44520 
agagctcaca gggtgctcag tatatgcctg ataccctgaa ggagcagagg gggtagctca 44580 
ggagaccact ggggaaagcc aagtttgctc aggaagtaag ggaagaggat gctggctggg 44640 
gcagtgtgga cctctgagag attgatcagg gcatggtttc tgcatggatg ctgcctttta 44700 
agctggaatt gaaaaatctt tccatagaag tataatatcc atacagagaa gtggataaat 44760 
cataattgcg ccaccgaatg agtttctata atgtgaacat ccccatgtag acagcactta 44820 
gatcaagaaa caacactctc agaagtctag aagcttccct catgcccctg ccagttacta 44880 
cacccccccc ttcccccagg gcaaccgcta tcctgacttc caacagcaca ggtgagctct 44940 
gccaggcttt gaacttacag ggggaaccat atcgtgtata tatactcttt ggtggccagc 45000 
ccctttactc aaccatatgt ttgtgacact catccatact gttgggaaca gttgtactgt 45060 
ctcctacctg ggtactattc ttttgtgtga ctctgccaca attaatttat ccattctaca 45120 
gatgatgggc atttgcgggt tttccactta aattggaagg gtgagaaact acggctgaca 45180 
gagagaaggg cagagtattc caggtgggga aatgcccagg gtaagaggct ggaggtgggg 45240 
tgtaggaaag agtaagcata aacctacatt ccacaaggga ctgccgtgtc agacaatgtt 45300 
ccaagcactg taggcgtact cgctcatttc atcctcacaa cagccagatg aggtcggtgc 45360 
ttttgtcttc attccactaa tgaggaaaca gaaacctaga gaagttaaga aacataccca 45420 
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gtttacacaa tcagtggcag aaccaagatg 
cagagaggtt cgataacttg ccggaggtca 
tccctc 

<210> 7 

<211> 16595 

<212> DNA 

<213> Homo sapiens 
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atgcccctgc ctgtcactac atgccccctt 
aatagcatag gtgagttctg ccaggctttg 
tgtatatata ctctttggtg gccctacatt 
tatattgaaa tatatattgt tgagtatact 
caatacatat atacaataca atacatatat 
caaatgtaat atacaataca tatatactca 
atacatttga aatgtagctc tttccaagag 
aaagggggcc aacagtgtat ggtaccacct 
tcactccaaa gggaagccag cagaaaagcc 
gaatgggggt cacctctctc tgaaaggcgg 
gaggtaggct tggcaaaatg ccaccctgga 
tgtgtttgtc ctttccatct ccctgggctg 
ccatctctat agtcctttgt gattatctcc 
gaccctcgca caagggtctg ggagccaaag 
taatgacagg aaaaggcctt gggaaagagc 
tactcttccc caacaaagcc gacttccggc 
cctcaccagt ccccagggtt tcaggggcga 
gagggttggt ggagagaagg ccacaatagg 
tgccctcctt gggcccaggc cagcccgatg 
gcaataagag gaggaagtta tccaggcagc 
aatggaaggt ggaggagaga aggggaaaga 
ggggacaacc aagaaaggaa gatggaggag 
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ggaacacttg ccctgatgtg aagtgggctt 45480 
caaatccaga gttgaacttc tgagaaagct 45540 

45546 



tctcccacag taaccactat cctgacttac 60 
aactataatt atacaggggg aaccatatgg 12 0 
tcaaatgtat tgtatattac atttgaaata 18 0 
caatgtatat ttcaaatgta atatacaata 24 0 
actcaacaat atatatttca atatatattt 300 
acaatatatt tagtatattt caaatgtaat 360 
gcccacacat tcccctgagc catgaagtga 42 0 
cccccgtcag gcacgactcc caggaagtcc 48 0 
agccaggctc aagaatttca actcaaccct 54 0 
tcaagatact tggggctgtc cctgaggttg 600 
gggccctgaa actcgatcac ccaaagaaca 660 
agagtagcca actgggccca agacccagca 72 0 
tcccactttg gactaaaact gagacagagg 78 0 
gcctttcctc ccagccccca gactgcagat 84 0 
tgcaattaga gggcaggcag gcagtgaatt 900 
cccatgcctg ccctcctgct tgctttccag 960 
ctcttagcct cctggtctgc agatcaggct 1020 
ccccatcggc ctataaatag cagcccagcc 1080 
cccaccctct ctccgttccc tcttttccat 114 0 
tgcatcctag ccatacaaga ggaggaaata 1200 
aagggagaaa aagaggaaag aatagtggga 1260 
atgcaggtga accagagtgg tcaccctgtg 1320 
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gtcatgcttc ccctccccca cacccaccta tggccccctt ttcagctccc cagggcaaca 1380 
cagaggaggc tttctgacag aggctgcagg caccccaccc atcaggcccc aatcgtgcta 1440 
gcgtctgctt ggcctgacca ctgacctagt ttctcatcaa ctaccttgct gtgactttta 1500 
ggaatgtgtg acccctgacc cccaattgct gacttggctt tggtcaagac ccatcaagag 1560 
tactaaactg ctacactgca gtccccagga gttggggcca attatttgtg tgtgtgttgg 1620 
tgggggacag gtgatattgc ccctgccctg ggagtttgtc actggacaca cacaacccct 1680 
ttagtaggaa agaggatgtg aaaccttcaa cgttgtgtgg gtttggggct tcagttcctc 1740 
aagcccttat ctcgggtgtc tgcagtctat cccactctat ctgtctggcc atctgtgaac 18 00 
cctggttcct gtgtgtatct tctggtcttg actgtttatc ttgatacagg ctcagcctta 1860 
acttcttgtg tcgagggcct gtccccctta cctgccttgc cctggtttct ggctgctacc 1920 
ttgcatgtgc atgcatatat atttaagaaa gctcactaga ctgggctcgg tggctcacgc 198 0 
ctgtaatccc agcactttgg gaggctgagg cggcagatcg cgaggtcagg agatcaaggc 204 0 
catcctggct aacatggtga agcctcgtct ctactgaaaa tacaaaaaat tagccgggcc 2100 
tggtggcagg cacctgtagt cccagctact caggaggctg aggcaggaga atggcatgaa 2160 
cctgggaggc agagcttgca gtgagcagag atctcgccac tgcactccag cctgggtgac 2220 
agggcgagac tccatctcag aaaaaaaaaa aaaaggcaag ctcagccggg catggtggct 2280 
cacacctgta atcccagcac tttgggaggc cgaggtgggc agatcacctg aggtcaggag 2340 
ttcgagacca gcctgaccaa. catggtgaaa ccccgtctcc actaaaaata caaaaaataa 24 00 
taataataaa ataaagctca tatcctgaaa aacatctcat caacacagac cacaaacaat 24 60 
aataggtcat gcctaaaagt ccaaacactg gcaaacgtca ctattgccag ctcatcacat 2520 
tagtggctaa gattaaaagc tgagaagaaa aagaaaatgg tgtctttctt ctaccaaatt 2580 
agccccaatc acgtatgtcc taaatctgtg ggccccttct gtgaataagc ctctactgtg 2640 
cttttccaga aagtgtcatg ccaggtcttg ctggcttcta gtcagattgt cttgctcttc 2700 
ttgggacaca aattcatcag gctggagacc ccagtgtgtt tgttcctatc actgtgactg 2760 
gggacctggc tcagagtaca cccaggacat ggaggccaga gccacgggtt ttgggaggga 2820 
tttgctcagt tacacactgg gcgatgtgct catggatgct gaggtatgtc tctagtagta 2880 
gatacacatt tactctgact catgtagtct ttcactccag gattctgggg accaaacttg 2940 
aggaactggt ggaatttcag gctaagtctg aggcccgatt gaggtgtgaa gctcttgagg 3000 
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tcaggaactt ggtctgtggc cagggctggg gtgaagaatg gaatgaaacc aggtagcccc 3060 
aagaacatcg aggtgcgtca cccattagcc tggggctccc atggcaatgt ggctctgtca 3120 
gctctgggct gtgggtattg tgtggtgggg ggaggcgagt atgtgggccc tcgggcagcc 3180 
atttagccca ctgattgcaa cacccaccct ttttccagct ctcccttcag cgctccagca 3240 
gcttcaagga ttttgccaaa tccaaaccca gctcccccgt ggtgagcgag aaggagttta 3300 
atctggatga taacgtgagt ttcagggcat ccttgtggga tctggctgca ggccctgggc 3360 
aggggggtgg gggtgggagg gaagagggtg aagaggagat agaattgttg gggcgaagcc 3420 
ccctttaaca cagagggtcc acctctcccc acaccaagca ctcccctgca ttcctttcaa 3480 
ttatacatta aatagcaaac tatttataaa tttatagggc taaggacatg gctggggtta 3540 
agaggggagg ggatgggggc ctctcagttc caccccctcc agcttctgct gtgaccccaa 3600 
ggcctctttt gcctcaagtc ccggacctag gcaggaccac cagggtccct ggatcgcctc 3660 
ttttgttgtc acagattcca gaagatgact caggtgtccc caccccagaa gatgctggga 3720 
agagtggcaa aaagctgggg aagaagtgga gggcagtgat ttcccgaacc atgaacagga 3780 
agatgggcaa gatgatggtg aaggccctgt cagaagagat ggtgaggcct gcagatatag 3840 
gggatggggt gtccaggggg cctggggacc gctctggcag aatgtgagca tgaccacctc 3900 
aatagccact actcaggccg gaaggcccta tttgatgcaa gaaggaaggt cacatgggag 3960 
gggaaactca cttgcagcca cggacaggca gccaggcaat cttgacgggg caggaggctg 4020 
gcgggggaag gggtggagtt tgagaaggga tgaaagtctg ggcaacaggg ctggacatgg 4080 
tggctcctgc ctataatccc agcactttgg gagactgggg ccgatcactt gaggtcagga 414 0 
gtttgagacc agcctggcca acatggcaaa accccatctc tactaaaaat acaaaaatta 4200 
gccaggcgtg gtggcacatg cctgtaattc cagctactca gagactgagg caggagaatt 4260 
gcttgaactt aggagacaga ggttacagtg tgccaagatc gtgccactgc actccagcct 4320 
gggcaacaga gcaagactct gtctcaaaaa aaaaaagaaa gaaagaaaga aagaaagaaa 4380 
gaaagaaaga aagaaagaaa gaaagaaaga gaaaaaaaaa aaaaagaaaa gtctgggcaa 4440 
gggatgcttc ttgaggaggc gggcctggga caggggcagc ttgagaaaga tgaaatggcc 4500 
tgaggcaatc catccacctc aaagctttgc tttggaactt ccccgggatt gggaaggcct 4560 
attttctctc atagaatccc tatgagagaa aacagattgg gcacagtggc tcacacctgt 4620 
aatcccagca ctttgagagg ccaaggcagg tggatcacct gaggtcagga gttcgagacc 4680 
agcctggcca acatggcaaa atcccgtctc tactaaaaat accaaaacaa attagccagg 474 0 
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cgtggtggca ggtgcctgca accccagata ctcgggaggc taaggcagga gaattgcttg 4800 
aacccgggag gtggaggttt cagtgagcca agatcacacc actgcaccct ggtctgggtg 4860 
acaagagcaa aactccgcct caaaaaaaag aaaaaaaaag agagaaaaca gagaaggctg 4920 
gcttcagccc agggaggaaa gttggcacag gcagctgtgg ggcaggcatg acccaagaag 4980 
cttaaatcac acagtgggtt tgggttgcca tggctctaaa aggagccact gaggcagtgg 5040 
tgtgctggag ttggctcgta ctgggctcat accagctcac aagagcagtt ggtcaaactg 5100 
tcaaaaattg tgcaagccag ttgttaaaca caaccattat taaaaatcaa attaaagaaa 5160 
cttacaattc agttaattat atttttttaa aaaggtaaac tgggcatggt ggtgtacacc 5220 
tataatccca gctactcagg aggacgaggc aagatgattg cttgagccca gtaagagtcc 5280 
agcctgggca acatagggag actctgtctc taaataaata aatttaaaaa aataaaattt 5340 
tgtaaaaacg aacaaaggta atacagtaaa tcttcaaaac tcatcacttc ctaattcttg 5400 
cactaccttt cactcttatc aatgctgctg aggtcacgtg catccatagc atcttcatgg 5460 
cggcaacagc ctcaccctca tttcagacaa cacgtccagt gatgtcaaaa tggtagcttg 5520 
acattggccg tggaggggag cgtttacaac acaggaattg gcaactcatc agggccagct 5580 
gttaaacagt tatcacctca ctgaggtgtg gtgaggtagg gggagctctg gagtctggct 5640 
cggttgcggg ttgccttggg acttgaacct gaattccctt ggcctgagtt tacagatgag 5700 
gagcaagtaa cccttgggcc ttactcctca tttgtaaact caggccaaca ataccactta 5760 
ccttcaagtg tcactgtaag gattcagtga aatggcaggc aggcctggca cacagaaaca 5820 
caggcaaaag tagtttcctc tctagcgaca ttgtctgccc agttgtgcgg tagcaatggt 5880 
tgagtccagg ttgtgccacc gcggtctttt ctgtatgtac atcggcactc gaggctgaca 5940 
aaagcacatc cacatggtgc cctggttgaa cctgtttctg tgaggtaacc agggcaagga 6000 
ctgtcactct catttttaca aaaggggaag aaactagggc tcagagagtg ccattcagcc 6060 
tataagcggt agagttgggg tttgagccag cccctagctc ttcactaagc ccaggcccat 6120 
ttcttcagct tcatgcacac cccagcaagg ctccctaaag cttcccgcct accctccctg 6180 
caggcagaca ctctggagga gggctctgcc tccccgacat ctccagacta cagcctggac 6240 
agccctggcc ctgagaagat ggcgctggcc ttttctgagc aagaggagca tgaacttccg 6300 
gtgctcagcc gccaggcatc aacaggtgag taggggatgc gggggacacc tgccgaatct 6360 
ggaggaaagg actgggttac agccgtgctg gtggcacact gtctgggcgg ggggtaagag 6420 
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aaatgaatgc ttactctttg cctgctggga acattctcag tcctcttcac attaaatctt 6480 
ctcaacaacc tcatggggcc agtactgtta tccttgcttt attgatgtgg aaacagggtc 6540 
ttagggaggt taagtgactt gcccaagatg acccagtctt gaactcagat gtaccgtaga 66 00 
ggaacttcca cttgctatcc tcaggccccg gagaggccca cgggcagagg aagagtctca 6660 
gtcaccctct tggaccagcc cgtccccacc tttccaaggg gagctagagg ctacagatcc 6720 
ctgggcctct gatgccccag tcgggggtgg gcgtatgagg tctccagctg tgaagagtca 6780 
gcctcttgac cctagaccat agttgctggg ctcagggcat cacctggcat cttctagaag 6840 
tccatgggac aggacaagag gtgccagcag ggaactcgtc caggcagggg ctttgggttc 6900 
tgcagcccca ttcagcaccg cctttcttgc ccctcaagtc cactccaagt ttcctgcttt 6960 
ctctgcctcc acagacactt ccctgccccc gactgaaaat cattcaaaat gcccactact 7020 
ctgtacttcc cagggagaag gagcccaact gagtttctat gtgacattgt aactacacat 7080 
tgagttaagc agggaaagtc aaatatgaat gcagggcata gcttcctggg tggagggagc 7140 
gttggagtgg gtatcaggga acatggagtc taatggactg actcagtgct gccagttata 7200 
aaatgagaag ccttgagcag ttcacttggt ctgtgtgacc tcagatttcc gcatctgttc 7260 
attggagtgt tggcccagtt gttctctaga ggccctccta atttgcccat tctttggtca 7320 
gtgagccatt tctcctccca ttgccaactc ttttcatgtc caagggccca gaggtgaagc 7380 
cgtgctcctg gtcggtagcc aagggggtgg ttcagaatcc cttctgcagt gtaaagcttg 744 0 
ccagagccca gctgccttcc cggtgtatac ccttgggatt tctaggcagg tcccctggaa 7500 
ggcacctgct aggcactgtc tcagctgttc taggacaaag gtgagcctcc tccctgcccc 7560 
atctcctgca ggcagtgagc tctgcagccc cagcccaggt tctggcagct tcggggagga 7620 
accacctgcc ccccagtaca cagggccttt ctgtggccgg gcacgagtcc acaccgactt 7680 
cactcccagc ccctatgacc acgactcgct gaaactgcag gtaagatcag catccgggct 7740 
tctctggagc ctggcaggct gtgcccaaaa aggaagctgg actgaaccag gctatgacag 7800 
tgtcagtgga ggccaaggcc ccccatatcc tctctccctt gctcccttct cctctcctct 78 60 
cccccaacag aaaggagatg tgatccagat cattgaaaag ccacctgtgg gcacgtggct 7920 
gggcctactc aatggcaagg tgggctcttt caaattcatc tatgtggatg tgctgcccga 7980 
ggaggccgtg gggcatgccc gccccagccg ccgacagagc aagggcaaga ggcccaagcc 8040 
taagaccctg catgagctgc tggagcgcat cggcctggag gtttgagctt ggtcctcact 8100 
agtatctagt atcagggagg cacaactgcc ccagggatgg ggaccaggaa atgacagcta 8160 

123 



9911799A2 I > 



SUBSTITUTE SHEET (RULE 26) 



WO 99/1 1799 PCT/US98/18426 

tgcgtagttg gggaggacct aggcaggggt ggctggtaag cagctgtgcc gatggcctgc 8220 
ctctgcctac aggagcacac atccaccctc ctgctcaatg gctaccagac actggaagac 8280 
ttcaaagagc tgcgagaaac acacctcaat gagctgaaca tcatggatcc acagcaccgg 8340 
gccaagctgc tcacggccgc cgagctgctg ctggactatg acagtgagtg gctttaggag 84 00 
cggcctggtg agggtgtgtg cccaccggca ttccagggag gggaggcttg ccctggcctt 84 60 
gccttctgtc cacgctctgc cctaggactg ctctgcagtg gaaaggtact ttccacttga 8520 
attagaattt cagggaaagt gtacgggaga aaggagttgt agggatgatt gggcccaacc 8580 
tcctttggat caaaggggac ccttaaggcc aaagaaggca aagccttact tgaggcctca 8640 
aagctgagta ataacagagc caggattcaa gcccactgcc tggctccagc ttagtgctaa 8700 
agaagtgtga gctcctggac tgcagagctg gcctggaaac aactcctacc agcttctaag 8760 
ctggaagcag tgaggagagg ggcagggcgg tggcaggtgc ccagaaggag agactgccta 8820 
tggtgtattt cccaaggtcc ctacctccac ccccatattc tgtctccctc tctcgtccct 8880 
ggcagctggc agtgaggagg ctgaagaggg cgccgagagc agccaggagc cagtggcaca 8940 
cacagtgtcg gaacccaagg tggacatccc gcgcgactca ggctgctttg agggctcgga 9000 
gagcgggcgc gatgacgcag agctggcagg cactgaggag cagctgcaag gcctctccct 9060 
ggccggggca ccttgaggtg gcggtggcaa taggccaagg ctgggaccca gctgcaaagg 9120 
ctgtaggagt gggcccagcc tcccgtggtg gcccaggtcc tgaggactgg cactgagcct 9180 
ggccctgctt ccccagggac acttagggcc acagaggcca ggccagggcc ctacaggttc 9240 
caggctcagc tggagtggtt ggggagtcgc ccaagggcac atcccacctg cctgagcccc 9300 
gccctccacc agcgactgac agcgcagccc ctcctggcac caactgctcc cctgccatgg 9360 
ccacggccac agcaagtggg gcactgggaa accctgccca tgtccctcac caacaaggcc 9420 
tccaaatcct cctcaccccc acaccaccta cccctgtcgc actgctcctg aaaagggggc 9480 
caagtcaatg tttcaggtca gtctaaaaac cctagggaag ctggccattt aaaagaaccc 9540 
aaactgacca tgggtaaatc cagttcccct aaataaggcc tgaagaaatc cacaggtacc 96 00 
attcccactt tccttctccc tagctttctt agaggtttgg ccactaaatc ttatgagact 9660 
tgaaccaagt ggcttcctct ttctaggctt aggacgggtt ggggttagaa agggtgatca 9720 
ctgaaggcct tgcctgctct gacattctgt gacattaaat gtctattctc ctgttacctg 9780 
tggcctggga caccagtggg gtttatcgag gggaccagag gggcctcagg ctttcagatg 9840 
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aaatggctcc tcctactcac ccactttatt cctctccatg taattcagga caagctgcaa 9900 
cttcccccag cttaacacaa tgcccatacc tcatacgata tgcgccctcc cgttccatcc 9960 
ctggccccct caaacgagac ttctcacaag gctgattaca gatggtcaaa cctggcttcc 10020 
aaggacagaa ttgcctctcg gaagccagct gtggatctga gtccagagtt ggccacttgt 10080 
gtgggtcctc acaagcaaag agagcactaa acttgacatt gggggtccac cactccaact 10140 
ttgctttctg aaggttttgg tgtacattga gccccagaag gaaaggagag tatctgtgag 10200 
tgggggcctc ccttgacccc agtacgaagt ctatgccctg aatccccaga gtagcccttc 10260 
ctggtgccca actggcctgg ggacaaacag cgtccactac atctaggact gccggctaag 10320 
tggacacact tcttgacctc ctaccaggaa ctttggtaaa agctagcttt ggggaagggg 10380 
ttgggtgtaa atatgagagg gtggagggag accagctggt agcaataaac atgggtagaa 10440 
ctaaattacc gtctccagtt atcttttcta tggagagagt gttgtgggga ggggcagacc 10500 
ggtctccttc aaagctggcc tcagcaaagt gtccctcact gtcctttcag gtccatcttt 10560 
cccttccctt aaatgttcag tgcccttgac tctgctgacc taaagctcca gtctgaagcc 10620 
ctagctggct ctgccctccc ctctaaccag ccctcctcag aacaaggctc aagctcccat 10680 
gaccacgggc tttgctgggg tccaagaggt gtagggggga atggctattt ccctcatcca 10740 
ataactgttc attttaacag ggcccttaaa gaccttcacc cgtgtgaaga aaggcctgca 10800 
ctgaggagct gtccaggatc taagaggggg agatttgggg tcagcatggc ctttcctctg 10860 
aagtcacctt ttcctggccc ccaccctgta cccactaaag cagtgccatc tcctgggagg 10920 
taggatggag atgaagacct ctagcttcct ttctgtttct gccagaatta actgcattgg 10980 
gcattgggaa gggggttact ggagagagag ctgccaccag agtgaggacg aggccacttg 11040 
acttccaggc cttgttctca gttgcattca tcccaccacc cttagtgact gggggtgcca 11100 
ggaaactgca agcatgatcc tcaccaaaga tataagagcc ctaacagcct caaagcccca 11160 
agggtactga ataattgcag tcatttaagg agcacctcca cttgtgccag gcactgagca 11220 
ctttacatat agcatctcct gtaatcctca cagtgtcccc ctgaggtggg tgccctcatc 11280 
atcccagttt acagaggagg aaactgaggg ttggggaggt tgaagaagtt gactagaaag 11340 
taacagagca ggagtcttcc tgactgcaga acccatgttc ttaactccta tgctacaatg 11400 
cttctccaaa ggcctcatcc aacaagcatt caaaagtcct gggcactttg gagggcaagc 11460 
tgactctctg acccagccat ttcccttcta gaatttaatc agccacataa gtgcagaaac 11520 
agccacataa gtgcacaaat acctatggac aaagatatct acaacagctc tgtctgtaat 11580 
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agtaaaaaat tgaaaacaat gcaaatgggg aattaaatgc tttatggttt gttcatatac 11640 
cacgcagcca tgacacatga tgttgtagaa aaatatttaa taacatgagg aaaatctttg 11700 
tgttttttta ttctttgatt ctatttattt ttctagagac agggtttcgc tctgttgccc 11760 
aggctggagt gatgtggcac aatcatagct cactgcagcc tcgaattccc aggttcaggt 11820 
gatcctccca cctcagtttc ctgagttagc tgggactaga ggcatgcgcc accacaccca 11880 
gctaattttt gtttagtttt tgtagagatg aagtgtagtt atgttgccca gttgcccagg 11940 
ctggtctcaa actcctgggc tcaagcaatc cgcccgcctc agcctcccaa agtgctggga 12000 
ttacaggcgt gagctaccac acccagcctt caatttcaaa tttgaaaatt acacaaataa 12060 
tgtataaaca ttttagttgt aaaataaatt tgtgttatag caaggggaaa agcaaatcct 12120 
aaaacattat taataggatg attccatttt tattttagaa aaggagaaca tatacataga 12180 
ataaagacta tgaagatctg caccaaatga ctactactgg tgatcttgtg aggcgtgtgg 12240 
ggtgggacta ggggagtgag ggagaaaaat gctcactttc tacattcttc ttttatttgc 12300 
acttcaaaag tattattttt atatctatga agaaaaggaa actgcatttc aatttttgaa 12360 
aaaaaaaata aaactaaatt aacaaataaa tagggcccac gcctctggcc tccaatacgg 12420 
gacagtagca gcaaaatcat tatcctgctg ccttttggta cacggttctg gaaagatgag 12480 
aggtgaataa tggccaaagt gcttaaaatc catgcatgcc agggtgcacc tatcagagga 12540 
gattcctaaa ccttttaaaa cccctgtgta ataggtattg ctctaagctg gggaaaggaa 12600 
aagatcagaa ggagatgaat gcctgggaac atgatgtttg gaaatcctct cctaccctgg 12660 
ggaggcagct caggggtggt gtgtctcaga tatcaggccc agtttggcat ataggatgat 12720 
tacaggctgg aggtaggtag ctgtgcaagg accccaagga tcacagtagt ttgggagggg 12780 
caggtaaggc ggggcatcaa gggaatgtag aacaggccag ggcccaaggg gcaagaaagg 12840 
cagtgctgag gagggccaca gggcccaaag ctcctcctcc aggcagggct gctgacacct 12 900 
attgtcctgc cccatgtact tctagaagat gccaagtggt gccctgaatc cagcaaccaa 1296 0 
ggtctagggt gcaaagagct gattcttcac agctggagac cttacacgcc caccccagcc 13020 
tgggatgtca gcagcccaga gagctgtagc ctctagctga ctccccttcg aagggtgggg 13 08 0 
acaggctggt ctaaaagggt gatgaggcaa gctcttcttc tgcccgtggg cctctccagg 13140 
gcctggggag agcaagcaga agcttctcta gggtagttga ggcctagcac ttaggagctc 13200 
aggcccacct gagttcaagt cttggctgtg tcacttccta gctgggtcac cttagacaag 13260 
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tcatttaacc tccttgagcc tccgtttcca cgacaataat gtaggaatac tagcactagc 13320 
cccatggggc tgttatgaag atttaatgta ctagcataaa gtcctttgca cagtgtccag 13380 
cacaaaataa atgctcaagg agtgatagtg tgtctggctg cctcgatttg ctccatcctg 13440 
ctacctccat ttcactcctt tcaaggccca tctgaccttt gactcccagt tagtatagtg 13500 
aagtgaccaa aagcaaaagg ccttggagtt caaataaagt gttttcaaac tcttgctaca 13 560 
ccacttacta gttatatgac attgggtaaa gtacatacat atacatttct tttctgtgga 13620 
acaggaataa tgatctaata atacctacct cttgggggtc atgagagaaa tcaatcagat 13680 
aactcgtaca aagttcctgg cacataggac attaaatggc ggctattcga cttttttgta 13740 
aactccaagg cactgagctc aagactgggc acaaaggcca tgtcggtaca tgcttgtctg 13800 
aggactgact gatgttcaga cctgtcatat cccagacacc actgatgtcc aggtccccag 13860 
gaggtgttgg gattcagggg taaagggtgc caactctggt acccagagag gctggtgatg 13 920 
aaagaaaacc cacttgagga gcctggggca gccctggctc tgcccagtcc acagcaagtt 1398 0 
gatcagccct gggcattaga cttactttag gcaactctct tggacttcca ggagctgcta 14 040 
gaaagaagag aataaagacc agatttcaga agaaagagtt gagtctagcc agctcctcac 14100 
cagttcagag ttagagccac cttctgtgtc tccttcctat caggagaagc agctggaggc 14160 
accagcacag cccaactgaa ccatgtcctc tctcacttgc ttcctctgta tctcacaacc 14220 
tgaccttaga ctaccccagc aggttctgat ccgcccccca aaggcctctg caaattccct 14280 
ctcctaaacc ccaaacccta tggcactgtg atctgataca tggcactgag gccccagccc 14340 
ctatgagtca gttztctcatt tcctactcac agagctcccc gcccacagtt gggccatgag 14 4 00 
agacctaggc attaaaacaa aggctaaacg aggtgtttta cgggaaaaat aggcactgat 14460 
caccatcgtc tcaacaaggc ccaacaaggc ccttcccccc tcccaagtca ggaaattcag 14 520 
gatgaggtat gaggtttccc actggggaaa agaagtggct ctcccacctc tccttcacaa 14 580 
tagaggctac catagcttct cacacaacat tggccatatt acatattatt cagccaacaa 14640 
cacaatcaag tgggaaggaa ctgcctcccc tctagactca gccaagactc ctcctccagg 14700 
aaggacctaa ataaatggaa ttctatagca cttgatatgt aggcttcacc ttttcctata 14760 
acttcacagt aaccaatata gcatttgtat accattgcct ctgggacctg ccagggatag 14820 
cgtcagaaag agcccagagc aatggccctc acaaagggac agccaacttg actgtgggaa 14880 
cttagcaagg caatttcctg actttctgtg ctaaagaaac cacatcatca gttcaaccta 14 940 
aatgcagggg tctggagaga gacttaagag agttaagttg aaaacaaaaa tccaacaacc 15000 
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agcctcatat cagcccgact atatataagt cctctgtatg cctaaaatgg agagcttacc 15060 
agtaatattc accccaaatg gcaaataatc ctaaaaatca ttgctgtttc tcaacttacg 15120 
agggttggta ctgcaatttc tcctaaagtt aaccctagca tgcagtcaat cttcaacaag 15180 
tatttattga atgttaaata aatgaaagct tcatttcatt caagtgtcat ttgcagtcat 15240 
tctaactctt ttatccatct acacccgctt gacagaggac cagtcggtta tgccagaaag 15300 
ctctctagac cgggagccag tacatctggg ttctggtctt ggctctgaca gtaactctct 15360 
atgtgaccct aagttacagt atttaatttg agggagaaag agagagagag agagaagcat 15420 
agactacatg atcaatatgt tcctttaagc tatgtcgttc aacagaatct gtgttttatg 15480 
acgaataatc atcatgtctc gtatttgggc agaaaacccc aaccctttta accaagttgg 15540 
aacagctaaa cagctggaga tgggcagggg ctggcagaaa aagcagtgag aaggcaagtg 15600 
ggggagtgaa gaaagtgaag ggcttggaat gtcagggagg taattctact ctagattcta 15660 
aagttgtaaa gccacttcta aatcaagggc cccttcattt cctggcccgt ggtgaagtgt 15720 
gagctctaag ttctaattta ttaatcaatc agaaaacccc ttaatgcctg aagaaaaata 15780 
gacaaaagat taggacagac aatttacaga agaaatacaa atggtaataa acataaggca 15840 
gaggagaggt gcagaaagtg tttaacttca ctattaatca aataaatata aactaacact 15900 
gagatataag tttacaacta tcaaattacc ataggtttag gtcgggcacg gtggctcacg 15960 
cctgtaatcc cagcactttg ggaggccgag gcgggcagat cacttgaagt caggagttgg 16020 
agaccagcct ggctgacatg gcaaaacccc gcctccaata aaaatataaa aattagccag 16080 
gtgtggtggc gcacacctgt agtcccagct aatgaggagg ctgaggcagg agaatcactt 16140 
gaacccggga gggagagaat gcagtgagcc aagatcgtgc cactgcactc cagcctggat 16200 
gacaaagcga gactctgtct caaaaaagct aaaataaaaa ttcgaaaggt ttaaatgaca 16260 
tgggttgatt agagtcttct ctgatacact gctggcaaaa caaacaggga cggcctttca 16320 
acaaagcaat ttggccacat ttatcaagag cctttaaaag agcattcccc ttgactcaat 16380 
aattccactt ctaggaatct atcctaaaga aatactcaca gacacaccca tatttaagga 16440 
agttcatcac agcattattt gtaataatga aaaattagaa actcttaaat gcctgacatg 16500 
tacacattac aaagtcatta agaagtattt ttgagccagg cgcggtggct cacgcctgta 16560 
atcccagcac tttgggagtc cgaggcaggt ggatc 16595 
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